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In this Issue 



At one time electronic design engineers only had to contend with designing 
analog circuits into their products. The only digital-like component in these 
products was the on/off switch. When digital circuits did come along, their initial 
use was limited to computation applications. Eventually, engineers realized that 
digital circuits could be used to implement many of the functions traditionally 
done in analog circuits. The result of this analog-to-digital migration is that we 
now have mixed-signal electronic products such as CD players, stereo players, 
and automotive electronics, which are designed with both analog and digital 
components. 

The primary test instrument of choice for engineers designing mixed-signal systems is the oscilloscope. 
However, because oscilloscopes have a limited number of input channels, they are not adequate for 
testing digital systems. Logic analyzers, which allow viewing several digital signals simultaneously, are 
used for testing digital circuits. While HP is a leader in providing hybrid instruments that combine oscillo- 
scope and logic analyzer technologies for testing mixed-signal systems, as pointed out in the article on 
page 6, these products are designed with full-featured logic analysis as their goal and the oscilloscope 
capability is treated as an adjunct to the analyzer. This article introduces six articles on a new family of 
oscilloscopes and logic analyzers that are targeted for mixed-signal testing, but with the opposite em- 
phasis. These instruments were designed with oscilloscope capabilities as a priority and logic analysis 
as an adjunct. 

The article on page 10 discusses an application in which the HP 54645D mixed-signal oscilloscope is used 
to simplify testing and debugging a microcontroller-based mixed-signal design. Topics covered in the other 
mixed-signal articles in this issue include the concurrent design and features of the HP 54645D mixed- 
signal oscilloscope and the HP 54645A two-channel oscilloscope (page 13), the effect of memory depth 
and peak detection on a digital oscilloscope's sample rate (page 23), the use of acquisition clock dithering 
in the HP 54546A/D to reduce aliasing and improve the display of signals that are subharmonics of the 
sample clock (page 26), the oscilloscope-like features of the HP 54620 logic timing analyzer (page 29), 
and the capabilities of the high-speed multiprocessor-based HP 54615B and HP 54616B oscilloscopes 
(page 34). 

Monitoring yeast growth in fermentation (a process associated with making beer) is one of the applica- 
tions for the HP E5050A colloid dielectric probe (page 37). The HP 5050A probe, in conjunction with an HP 
4285A precision LCR meter and an HP Vectra personal computer, provides online monitoring of colloidal 
liquid materials (e.g. water-in-oil or oil-in-water emulsions) in the food, chemical, and pharmaceutical 
industries. Key parameters of colloids are often directly related to or can be derived from the permittivity 
or conductivity of the material. The sensing technique used in the HP E5050A provides permittivity versus 
frequency characteristics, which can be used to determine the concentration, size, structure, and con- 
sistency of colloidal dispersions. 

The need for reliable networks has brought about a proliferation of test equipment for monitoring the 
health of a network. The next two articles describe network-related test products from HP's Communica- 
tions Test Division. The first product is the HP E4219 ATM network impairment emulator (page 45). This 
emulator allows telecommunication equipment service providers to emulate ATM (Asynchronous Trans- 
fer Mode) networks before deploying their equipment or services. The second article describes the HP 
E4214A Broadband ISDN UNI signaling test software (page 51). UNI (user-network interface) is the 
boundary between a private or public network and the user equipment. By providing protocol message 
decoding and encoding, the E4214A software allows engineers to develop, troubleshoot, and verify the 
features and functionality of UNI signaling. 

Once the equipment is in place to capture information about activity on the network, the next thing we 
need is some software that allows us to monitor and control network activity from a central console. The 
SNMP++ and SNMPGen software development tools (page 59) allow developers to create object-ori- 
ented network management applications quickly without having to be experts on network protocols. 
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SNMP++ is a set of C++ classes that provide an object-oriented application programming interface to the 
services of the Simple Network Management Protocol ISNMP) SNMPGen uses SNMP++ to generate 
C++ code automatically. 

Many of the circuits used in network equipment and network test instruments contain microwave com- 
ponents. A network analyzer is used to test these components. A network analyzer measures the magni- 
tude and phase of the reflection and transmission characteristics of a microwave component as a func- 
tion of frequency. This information is needed by designers so that they can verify that their components 
will work when they are inserted into a microwave system (e.g.. cellular phones, broadcast and cable 
TV, long-distance telephone transmission, satellite communications, airplane radar, and so on). Since the 
introduction of the early models of the family of HP network analyzers known as the HP 8720 family, more 
and more of these instruments are being used in production test systems as opposed to their previous 
R&D role. This change has created a need for more measurement speed and an expanded I/O capability 
for integration into production test systems. The article on page 66 describes the HP 8720D network 
analyzer, which contains features specifically targeted for the production environment. 

HP's Design Technology Conference (DTC) is an internal conference that enables HP engineers to share 
ideas, best practices, and results among HP people involved in the development and application of inte- 
grated circuit design technologies. When appropriate, we are able to publish some of the papers from 
this conference in the HP Journal. The last five articles in this issue are from the 1996 DTC conference. 
The first article (page 78) describes a project to estimate the impact of interconnect parameters on the 
performance of high-end processors (interconnect is the electrical conducting paths between devices 
or blocks on ICs) The objective was to use this information to optimize interconnect geometry. 

The next two DTC articles deal with mixed-signal modeling. In the first article (page 84), the design of a 
phase-locked loop, which is used in several HP ASICs, involved transferring the digital portion of the 
circuit to a behavioral VHDL model and creating a special behavioral model for the analog portion. This 
separation enabled ASIC designers to run system simulations. The second article of this genre (page 89) 
describes using a tool called SABER MAST for mixed-mode behavioral modeling. The tool provides a 
speedup of two orders of magnitude over traditional analog-oriented SPICE simulations. 

A combination of commercial ASIC vendor-supplied tools and some internally developed tools were part 
of a physical design methodology that enabled designers to develop 1.1-million-raw-basic-cell, 0.35-um 
gate arrays for HP Exemplar S-class and X-class technical servers (page 95). 

Finally, the ability to quickly turn around a custom IC design is very important in the IC design business. 
The article on page 104 describes how with the proper mix of new tools and methodologies, a 1.7-million- 
FET image processing chip was designed and verified in eight months. 

CI. Leath 
Managing Editor 



Cover 

Successively zoomed views of the analog startup waveform of a circuit, captured by the HP 54645D 
mixed-signal oscilloscope triggering on digital data. 



What's Ahead 

The June issue of the HP Journal will contain four articles about the PA 7300LC processor design, an 
article about the PA 7300LC-based HP 9000 B-class workstation, an article about the low-cost HP 9000 
D-class server, five articles about the DeskJet 820C printer, and four articles about testing critical soft- 
ware for medical patient monitors. 

Reminder 

Because we are getting ready for a new Journal design and focusing on other projects, we won't be 
publishing an issue in October. 
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A Family of Instruments for Testing 
Mixed-Signal Circuits and Systems 



This entirely new product category combines elements of oscilloscopes 
and logic analyzers, but unlike previous combination products, these are 
"oscilloscope first" and logic analysis is the add-on. 

by Robert A. Witte 



Electronic circuits have been a pari of modern life for so 
long thai most people lake I hem for granted. Some devices 
are inherently electronic in nature such as telephones, radio 
receivers, calculators, and personal computers. Oilier de- 
vices started out as mostly mechanical arid have gradually 
had more electronics incorporated into them over time. 
Electronics hits made its way into automobiles, cameras, 
water heaters, home appliances, elevators, thermometers 
and weighing scales. This "electronics everywhere'' (rend has 
gradually filled modern life so thai il is difficult to imagine 
what life would be like without all Ihose electrons running 
around doing their jobs. 

Most early electronic systems were analog, especially if they 
interfaced to real-world physical phenomena. The emergence 
of digital technology resulted in the gradual diffusion of digi- 
lal gales into applications I hat were once exclusively analog. 
Thus, an "analog moving to digital" trend emerged as the 
number of digital gates per acre of silicon continued to grow 
al a fast rale. Analog circuits will never be totally replaced, 
since for the most pari the real world stubbornly retains its 
analog behavior and circuits that interface to this world 
musi retain at least some analog circuitry. The result is thai 
many electronic systems are mixtures of analog and digital 
circuitry that have come to be known as "mixed analog and 
digital" or simply "mixed-signal." 

The single-chip microcontroller has emerged as an important 
component in these mixed-signal designs. Of course, micro- 
controllers have been around for decades, doing Ihe lowly 
control tasks in cost-sensitive applications while their more 
powerful siblings ( "real" microprocessors such as an 
Intel80486 or a Pentium") got all of Ihe attention, usually 
because of their critical role in personal computers Mean- 
while, Ihe single-chip microcontroller improved in perfor- 
mance, moving from 4 bits 10 S bits and lfi bits while also 
improving in cost-effectiveness. Without much fanfare, these 
single-chip devices found their way into a wide range of 
designs, causing one author to refer to them as "The intimate 
ASK'."' These devices are often used lo control or measure 
a physical system (e.g., anlilock braking, camera control, 
appliance controls, industrial control systems). A generic- 
block diagram for such a system is shown in Fig. 1. and an 
example of a mixed-signal single-chip microcontroller is 
presented on page 8. 

This increased use of mixed-signal electronics is showing up 
in a wide variety of industries and applications. Consumer 



electronics is an obvious area, w ilh mixed-signal designs 
being used in CD players, stereo receivers, tape decks, and 
camera eleclronics. Similarly, communications devices such 
as modems, telephone answering machines, and multimedia 
boards for PCs all use mixed-signal eleclronics. There are 
many applications in industrial electronics, including process 
control, industrial water heater controls and other sensor- 
based systems. The growing area of mechaironics ( the 
merger of mechanical and electronic technology) is primarily 
mixed-signal in nature. A large and growing mixed-signal 
area is automotive electronics, including subsystems such 
as the previously mentioned antilock braking systems 
and ignition control, Biomedical applications are another 
emerging area, with mixed-signal electronics being applied 
in pacemakers, heating aids, and other medical devices. 

Systems can be totally digiial if only on-off control is 
required. More likely, there is some physical quantity being 
measured or controlled that requires a portion of Ihe elec- 
tronic system to be analog. The increased use of sensors 
allows engineers to create smaller control systems that 
monit or physical events and do a better job of controlling 
the system. A good example of this is anlilock braking 
systems in automobiles. In this case, electronics (including 
sensor technology) is being used lo make the braking effec- 
tiveness of ihe automobile far better than would be possible 
in a purely mechanical system. 

Oscilloscopes 

For designers of mixed-signal systems, Ihe troubleshooting 
tool of choice is ihe oscilloscope. The versatility of ihe oscil- 
loscope for viewing a wide variety of waveforms allows tin- 
design engineer to see what's happening in the device under 
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test Other test instruments may also be used but the oscillo- 
scope remains the first tool that most users turn to for de- 
bugging circuits. 

Mixed-signal engineers expect many things from their oscillo- 
scopes. The oscilloscope must have sufficient fundamental 
performance in terms of bandwidth and sample rate to cap- 
ture and display signals accurately and reliably. At the same 
time, the oscilloscope must be easy to use so that the engi- 
neer can focus on the operation of the circuit and not on the 
operation of the oscilloscope. In the heat of the trouble- 
shooting battle, it is very distracting for the user to have to 
interrupt the debug process to deal with an uncooperative 

test instrument 

With the increasing digital content in these mixed-signal 
systems, a two-channel or even a four-channel oscilloscope 
quickly mns out of inputs. It is just not possible to view the 
complete operation of even a simple digital circuit with a 
conventional oscilloscope. Take the trivial example of a 4-bit 
up/down counter. Willi lour output lines, a clock input, a 
load input, and an up/down count control, the counter will 
hav e al leasl sev en digital signals associated with it (and 
perhaps more I. Since an engineer cannot view all of these 
signals at once with an oscilloscope, the usual remedy is to 
apply "mental storage." The engineer applies the oscillo- 
scope probe to a particular signal, views it. and stores the 
image into mental memory, then moves to other signals of 
interest and repeats the process. Eventually, a picture of the 
entire circuit's operation is formed in the mind of the debug 
artist. Sometimes this picture is precise and meaningful but 
sometimes it is cloudy and full of uncertainty; 

Logic Analyzers 

The idea behind a logic analyzer is I hat many digital signals 
can be viewed simultaneously so that an engineer can obtain 
;i complete anil accurate picture of what is real I v going on in 
a circuit. One of the trade-offs made in a logic- analyzer is 
that only digilal signals can be v iewed and only an idealized 
reconstmclion of the waveform is possible. That is, the wave- 
form is treated as a purely binary signal and is displayed as 
either a logic- high or a logic low. w itli no detail of the actual 
waveform shape. This is a reasonable compromise since the 
major problem in a digilal system is determining whether 
the circuil Operation is correct. It is a functional check that 
does not rei|ttire the waveform detail. In fact, il can be argued 




tliat complete waveform detail is mostly visual noise when 
checking the functionality of a digital circuit. 

As logic analyzers evolved, they tended to be optimized for 
solving the really tough problems that come with complex, 
bus-based microprocessor systems. Most modem logic ana- 
lyzers have a minimum of 32 channels and may have 128 
channels or more. They also provide extensive trigger and 
selective storage features that make them unmatched iti 
debugging and troubleshooting power. This power inherently 
leads to a certain level of complexity in the analyzer com- 
plexity that can be a barrier to the oscilloscope-oriented 
engineer. Many designers of mixed-signal systems, even 
when limited by the number of channels on their oscillo- 
scopes, remain reluctant to adopt logic analyzers. 

Mixed -Signal Oscilloscopes 

Further investigation into this avoidance of logic analyzers 
revealed the opportunity for a new kind of lest instrument 
that would expand the channel count of the oscilloscope 
without losing its inherent appeal. A series of market re- 
search activities were launched to determine how to fill this 
gap between oscilloscope and logic analyzer. Clearly, engi- 
neers were limited by the number of channels on their oscil- 
loscopes Inii ihi'v were not always adopting a logic analyzer 
a.- the solution. What eventually emerged is an entirely new 
product category that combines elements of oscilloscopes 
and logic analyzers. 

As a world leader in logic- analyzer products. HP had already 
pioneered the creation of hybrid instruments that combine 
oscilloscope and logic analyzer technology ( III* 1660 Series 
and IIP l(i">0() Series logic analyzers, for example).- However, 
these products were designed with full-featured logic analysis 
as their lop priority and the osc illoscope capability is an 
adjunct to the analyzer. The new product line starts with the 
oscilloscope user in mind. These products are "oscilloscope 
first" and logic analysis is t tic add-on 

A family of IIP products aimed al mixed-signal testing has 
been created. The HP 541)00 product line now offers the 
following sei of test instruments appropriate for mixed- 
signal work: 

• The IIP . r )4li4"il) mixed-signal oscilloscope (Fig. 2) combines 
two 100 -MHz oscilloscope channels with 10 logic liming 
channels. This is the ultimate mixed-signal testing tool 




(a) (b| 

2. Tlie HP 64645 A Wo»<*annel oscilloscope oo and the HP M645I) mixed-signal oscilloscope tin. both »iiii MegaZnnin, are examples 

Of the new product Category of mixcd-signal oscillciMupcs 
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Mixed-Signal Microcontroller 



As explained in the accompanying article. HP's new family of mixed-sig- 
nal test instruments is designed to meet the needs of designers of prod- 
ucts that are partly analog and partly digital, such as antilock braking 
systems, camera control systems, appliance controls, and industrial 
control systems. Many of these products are based on single-chip micro- 
controllers. The producers of these products are demanding simpler and 
cheaper electronic assemblies from system developers, and this pressure 
to reduce costs is fed back to microcontroller suppliers in a number of 
ways, most of which can be summarized as a greater integration of 
mixed-signal peripherals with the microcontroller core. Thus, the micro- 
controllers themselves are becoming mixed-signal devices. 

For example, Microchip Corporation's PIC14000 microcontroller inte- 
grates a number of peripherals that are often requested by system de- 
signers. This peripheral set, which is well-suited for slow-moving real- 
world analog signals, is packaged with Microchip's 14-bit microcontroller 
core to create the PIC14000. The peripherals are: 
1 Single-slope analog-to-digital converter (ADC) 
1 16-bit programmable timer with capture register 
16-ms maximum conversion time at maximum resolution with 4-MHz 
clock 

4-bit programmable current source 
8 external channels, two with selectable level-shift inputs 
6 internal channels 
i On-chip temperature sensor 

Two comparators with programmable references 
i Internal bandgap voltage reference 
Voltage regulator control output 
On-chip low-voltage detector. 

Wes Reid 

Senior Applications Engineer 
Standard Microcontroller & 
ASSP Division 
Microchip Corporation 
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Fig. I. Block diagram of the PIC14000 mixed-signal microcontroller. 



because it seamlessly integrates oscilloscope channels and 
logic timing channels. This product includes HP's Mega- 
Zoom technology, which delivers the benefits of a million- 
sample memory for each channel without the agonizingly 
slow responsiveness normally associated with deep memory. 

• The HP 54645A two-channel oscilloscope with MegaZoom 
{Fig. 2), a general-purpose oscilloscope, is the first afford- 
able 100-MHz digital oscilloscope with sufficient sample 
rate and memory depth to capture a wide range of analog 
and digital signals. The HP 54645A has the same basic Mega- 
Zoom technology as the D version but does not have the 
logic channels. 

• The HP 54620A/C 16-channel logic timing analyzer is an 
easy-to-use logic analyzer designed to be the perfect com- 
panion product to an oscilloscope, perhaps the one that the 



user already owns. This logic analyzer has an oscilloscope 
user interface metaphor so that it is familiar and easy-to-use 
for the typical oscilloscope user. 
. The HP 546 15B and 54616B/C 500-MHz two-channel oscillo- 
scopes are general-purpose oscilloscopes that provide the 
bandwidth and sample rate ( 1 GSa/s and 2 GSa/s, respec- 
tively) needed for high-speed digital wwk while maintaining 
the responsiveness and trustworthiness required for mixed- 
signal debugging and troubleshooting. The HP 54616C has a 
flat-panel color display. 

While each of these products represents a particular mix of 
features and performance, they all share the core values of 
the HP 54600 product line: 
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Direct Access ( ontrols OH Main Functions. The vollsdivi- 
sion. time/division, position, and trigger level controls are 
all dedicated knobs for easy access. 

Responsive User Interface. Instant response to front-panel 
changes is critical to making an oscilloscope easy to use. 
All of these products provide sufficient processing power 
to maintain a responsive front panel. 
Fast Update Rate and Low Dead Time. These products use 
multiprocessor arcliitecrures to make sure the display is 
updated quickly. 

Digital Peak Detection. These products have true digital 
peak detection, which is available on all sweep speeds. 
Digital peak detection keeps the sample rate operating at its 
maximum Specification even on very slow time base settings, 
guaranteeing that short -duration events (such as glitches) 
are not missed (see article, page 23). Peak detection also 
prevents aliasing. On the HP 54620A/C logic analyzers, the 
equivalent feature is glitch detection. 
HP Proprietary Alias-Reduction Techniques. All of the HP 
54600 Series oscilloscopes employ HP's proprietary alias- 
reduction techniques to reduce the possibility of an erro- 
neous displayed waveform (see article, page 2(i). 



• Compact and Portable. These products pack a lot of 
measurement power into a compact size and the small 
footprint helps save precious bench space. 

• Optional HP-IB. RS-232. and Parallel Interfaces. These 
products offer a choice of computer and printer interfaces. 

• Optional Measurement/Storage Module. This option 
provides FFT. advanced math, and pass/fail mask testing 
(oscilloscope products only). 

The accompanying articles in this issue describe some of the 
key design contributions incorporated in these products. 
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Testing a Mixed-Signal Design Based 
on a Single-Chip Microcontroller 

The HP 54645D mixed-signal oscilloscope simplifies the testing and 
debugging of microcontroller-based mixed-signal designs with its 
integrated analog and digital channels. 

by Jerald B. Murphy 



This article presents an example that shows how the HP 
54045D mixed-signal oscilloscope (see article, page (5) might 
be applied to understand the operation of a device that pro- 
duces a 60-Hz reference signal that is used by other devices 
in a system. The reference sine wave is produced by low- 
pass filtering a pulse width modulated signal generated by a 
PIC16C74 8-bit CMOS singlo-cliip microcontroller (see block 
diagram. Fig. 1). This is a fairly typical application of these 
powerful, low-cost devices. The low cost and availability of 
single-chip microcontrollers has resulted in their application 
in areas that previously were nonelectronic, and in some 
cases even mechanical. Examples include the replacement 
of clockwork-driven appliance switches and motor control. 

In this application we have a very stable signal being pro- 
duced With a simple design. An analog design with the same 
level of performance would have been much more complex. 



The pulse width modulator (PWM) of the PICH3C74 micro- 
controller is controlled by an 8-bit input. This input is pro- 
vided by a lookup table stored in the microcontroller's 
memory. In operation the microcontroller sends the first 
control input to the PWM from the lookup table, waits a 
specified time, and then sends the next value in the lookup 
table to the PWM. When the last value in the lookup table is 
sent to the PWM. the microcontroller's counter is reset to 
stall the process again at the top of the table. The input to the 
pulse width modulator is available on the microcontroller's 
port B RB0-RB7 data lines. 

In this application we can view the PWM's output and the 
8-bit data that produced that output. This will enable us to 
verify the correct operation of the PWM. 



PIC16C74 Microcontroller 



Address Bus 



Clock 
Oscillator 




Address 



1 



Duly Cycle 
Registers 



Timing 
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Pulso Width 


RC1/0SI 


Generation 




Modulator 







Port B to HP54M5D Logic 
Channels 0 to 7 



HP54645D Analog 
Channel A1 



Low-Pass 
Filter 



HPS4645D Analog 
Channel A2 



Analog 
Sine Wave 
Output 

Fig. 1. Block diagram of the PIC16C74 microcontroller and low-pass filter implementing a reference sine wave generator, drawing the 
hookup to the HP 54645D mixed-signal oscilloscope. 
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Fig. 2. Glilch triggering is used in locate the minimum pulse width 
of the pulse width modulator of ihe PICIfH'7-l micro, unt roller 



Fig. 4. A mixed-signal measurement. The delay from digital control 
In pulse width output is measured to be 8_2 \is. 



The firsl Step in lesling this design is lo verify the operalion 
of the PWM. We want to make sure that the niiniiniim pulse 
width is correct and the maximum pulse width is less than a 
period of the system's basic liming. The minimum pulse width 
Ls found by using the HP 5-J645D's glitch triggering mode. 
We simply use one of I lie analog channels lo probe the PWM 
and then select the "less I han" mode of the glitch trigger to 
find the minimum pulse width. 

The minimum pulse width should be approximately 1 (is, so 
we set thai as the search condition in the gliich trigger. Now, 
decreasing this value until the mixed-signal oscilloscope 
slops triggering (which happens immediately) isolates the 
minimum pulse width (Fig. 2). Applying the mixed-signal 
oscilloscope's automatic measurement feature gives a mea- 
surement Of this puke w idth as 1.02 us. The maximum pulse 
width can he located in I he oscilloscope's deep memory by 
delaying the sweep by 8.3 ms ( half the period of the tiO-llz 
signal l. Here we observe a pulse that is 2">(i us wide with 




— 06-015 i Reorder 
Off On DI5 - X 



clean transitions on either side. By probing the microcon- 
troller's port I! data lines we can observe the input and out- 
put of the PWM. The deep memory of the mixed-signal os- 
cilloscope greatly simplifies the task of observing this 
operation. Simply pressing the Autoscale key results in a dis- 
play thai contains both the analog channel displaying the 
PWM output and the eight logic channels displaying the 
PWM's input. The resulting display is nol well triggered but 
the waveform data needed lo observe the PWM operation is 
captured in the oscilloscope's deep memory. Simply pressing 
the Run/Stop key freezes the display (Fig. 3) so thai the con- 
tents of the oscilloscope's deep memory can be examined by 
panning and zooming, using the oscilloscope's time base 
controls. We see that the minimum and maximum values of 
the pulse width are related to the (II) anil FF values of the 
digital inputs. The display is zoomed in on a leading edge of 
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Fig. 3. \ mixed-signal display <>f the pulse width modulator OUtpUl 
thannel AJ and the digital control Hues n-7 irbo in RB7 In Fig- I) 



Trigger Soiree 

111 E9 DO bltOO Line 



Fig. 5. a view of the total operation of this mixed-signal system. 
The pulse width modulator output is channel Al. the mil put of 
the low-pass niter is channel A2. and digital channels 0 7 are tin 

pulse width modulator control lines. 
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Fig. 6. Cursor 1 1 is placed at the point where the digital control 
lines enter thr state FF. 

one of the PWM pulses and ihc delay from the digital input 
to the PWM output is measured to be 8.2 us (Fig. 4). We can 
c onclude that the PWM is operating properly. 

The next set of questions relate to the low-pass filter. 
One question might be, "What is the delay from the digital 
input of the PWM to the output of the low pass filler?" This 
measurement is easily accomplished with the HP mixed- 
signal oscilloscope. Connect a second oscilloscope probe 
lo the output oflhe Biter and press Autoscale. The resulting 
mixed-signal display contains the digital input to the PWM, 
I he v ariable-width pulse output of the PWM. and the output 
of the low-pass filter. The oscilloscope is now triggered on 
the rising edge of the sine wave output oflhe low-pass filter. 
The total operation of the system under test is now displayed 
on one screen ( Fig. 5). 

The delay from the digital input of t he PWM to the output of 
the low -pass filler can be measured by first placing a cursor 
on a known value of the digital input lines. After selecting 
the hexadecimal cursor readout mode, the tl cursor can be 
placed at the point where the digital lines first reach the value 
of FF 1 (Fig. (J). The t2 cursor is then placed at the maximum 
value of the sine wave. Switching the cursor readout to lime, 
this delay is measured to be 2. fill) ins (Fig. 7). This measure- 
ment was made without having lo take into consideration 
any additional delays that might be associated with the 
instrumentation. The deep memory of this product allows 
this measurement lo be made on data that was gathered on 
one trigger. There was no need to be concerned about errors 
that might be produced by having the display made up of 
many cycles of system operation. 



Fig. 7. With cursor t-2 placed CUI 'he peak of the low-pass filter out- 
put and the cursor readout changed to time, the delay from digital 
command to output is measured to he 2.(510 ins. 

In this example we have seen how the IIP 54645D mixed- 
signal oscilloscope simplifies the testing and debugging of 
microcontroller-based mixed-signal designs with its inte- 
grated analog and digital channels. The user oflhe HP 
mixed-signal oscilloscope has the advantage of MegaZoom 
technology's high-speed display, instantly responding con- 
trols, and deep memory to simplify the testing phase of the 
project The integrated analog and digital channels produce 
an easy-to-uiiderstand view of the device under test. The 
glitch triggering was used to isolate a point of interest. In 
this example there was little need to use any of the mixed- 
signal oscilloscope's pattern or advanced pattern triggering 
capabilities. These features provide even more debugging 
power when needed. Both CUIBOr and automatic measure- 
ments were used to characterize the device under lest. 
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Design of a Mixed-Signal Oscilloscope 



This combination of a digital oscilloscope and a logic timing analyzer 
provides powerful cross-domain triggering capabilities for capturing 
signals in mixed-signal environments. MegaZoom technology, consisting 
of advanced acquisition techniques and dedicated signal processing, 
maintains display responsiveness while making optimal use of deep 
sample memory. 



by Matthew S. Holcomb, Stuart O. Hall. Warren S. Tustin. Patrick J. Burkart. and Steven D. Roach 



The design of the HP 54645A/D oscilloscopes introduced in 
the article on page 6 began with the HP 54645A. envisioned 
as a performance upgrade to the HP 54600 Series oscillo- 
scopes. 1 These oscilloscopes, introduced in 1991, have an 
almost analog look and feel. Their three-processor design 
yields unprecedented display update rate and responsiveness 
at an affordable price. The major design goal for the HP 
54645A was to improve I he sample rale performance by an 
order of magnitude while maintaining the responsiveness 
and display updale rate of the existing HP 54600 Series 
products. 

Ultimately, two new products were created: Ihe HP 54645A 
two-channel oscilloscope and the HP 546451) mixed-signal 
oscilloscope. The mixed-signal oscilloscope is a new product 
category thai adds 16 logic timing analyzer inputs to the two 
channels of oscilloscope input. In addition to displaying all 
16 logic channels, the HP 54(5451) provides advanced logic 
triggering functions on patterns that can span all 18 channels. 

The HP 51645A and 516151) were designed concurrently. 
We made every effort to have the oscilloscope-only product 
(the HP54645A) be simply the combination product (Ihe 
HP54645D), wilh an external trigger circuit substituted for 
Ihe digital channel subsystem. Even the firmware ROM in 
the two products is identical. 

Architecture 

We started the design by modifying the architecture. A sim- 
plified hlork diagram of an HP 546011 Series oscilloscope is 
shown in Fig. U Two ICs form the core of the system: the 
acquisition processor and the display processor. The display 
system was left alone and only the acquisition circuitry was 
redesigned. We kept the same display, the same package, the 
same analog front end. and the same host 68000 processor. 

In the original HP 54600 design, the acquisition processor IC 
was responsible for taking Ihe digili/.ed samples from the 
analog-to-digital convener (ADC) and placing them into an 
external memory as time-ordered pairs, which the display 
processor IC placed on the CRT display. More specifically, 
the acquisition tasks include: 

Cen. 'ration of the sample clocks for Ihe ADCs and control- 
ling ihe decimation of ihe sample clock and dither algorithms 
for use al slower lime base settings. 

Peak detection of the ADC data. This block calculates Ihe 
minimum and maximum values of the sampled data. 



• Intermediate storage of the ADC" data ( or minimum/maxi- 
mum pairs if the peak-detector is used ) into an internal circu- 
lar 2K-sample memory. This memory, known as the rapture 
RAM. holds the data until the trigger point is calculated. 

• Accepting the analog trigger from one of the channels and 
measuring the lime between the trigger and one of the 
sample clock edges. 

• After the trigger is found, unloading the data from the 
capture RAM into an external RAM known as the wavqform 
RAM. For each sample value, a corresponding x-axis (time) 
value is calculated. 

All of these tasks were integrated into one c hip for Ihe HP 
54600 oscilloscopes. For the new products, we divided the 
above functions into separate, discrete components, as 
shown in Fig. 2. 

Clock Generation. Much of the difficulty of the original one K 
design stemmed from unwanted coupling belwecn the sam- 
ple clocks ;uid Ihe trigger clocks. In a digital oscilloscope, 
such coupling can severely corrupt the time base fidelity, 
causing time-axis nonlincarities ("bowing in liine"), lime-axis 
discontinuities, and sample bunching. In a higher-frequency 
instrument, the design of the clocking and trigger systems 
would have been all the more difficult. Consequently, Ihe 
new products have a separate, dedic ated bipolar IC for han- 
dling ihe clock generation and triggering. 

Peak Detection. As before, we needed a handful of digital 
circuitry that stands between the ADC and the intermediate 
capture memory. This circuitry lakes the 200-MSa/s 8-bit 
ADC data, calculates the appropriate minimum And maximum 
values, and stores Ihe results sequentially into memory. 

Additionally, we decided to improve Ihe averaging perfor- 
mance by digitally low-pass filtering the incoming sampled 
data before storing it into memory. This technique (later 
named smoothing) requires summing .".12-bit values at a 
200-MHz rate. 

We chose to realize these functions in a CMOS gale array 
known as the oscilltiscopc sh/itnl pivi cssar H '. In addition 
to the functions described above, this IC decelerates (or 
fans out ) Ihe ADC data, steering and decimating Ihe sam- 
pled dala into the capture' memory as desc ribed next. 

Capture Memory. Recall that this memorj is used to hold the 
incoming data until a suitable trigger point has been found 
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Fig. 1. Original UP 54600 Serins oscilloscope block diagram. 

and measured. In the HP 54600, the acquisition processor, 
being a fully custom design, had an internal 2K-byte (16K-bit) 
memory. Such memories are less available in gate array 
technology, so combining this function with the oscilloscope 
signal processor IC wasn't a particularly attractive option. 

At 200-MHz rates, or 400-MHz rates for the timing channels, 
high-speed memories aren't off-the-shelf plug-in items. A 
traditional ECL design would be power hungry and costly. 
Our solution was to use commercially available video 
memories (VRAMs). Since these parts have a 50-MHz 16-bit 
interface, we could store data at a 100-Mbyte/s rate per 
memory. Each channel then requires only two devices. Since 
these memories are dual-ported, we can access the data (for 
plotting) simultaneously with the acquisition process, an 
advantage that we capitalized on further, as explained later. 

Each VRAM has a capacity of 4M bits, resulting an SM-bit 
(or lM-byte ) memory depth per channel. This increased 
memory depth is a huge improvement over previous designs 
and delivers value to the customer through increased time 
capture at a given sample rate (see article, page 23). 



Time Base and VRAM Controller. Along with the advantages of 
all of this deep memory came the problems of deep memory. 
Another CMOS gate array IC was designed for managing the 
storage and ret rjeval of data into the VRAM array. Its primary 
job is to provide the necessary commands to the VRAM to 
manage the data flowing into and out of its two pons. It also 
provides a high-speed interface to this memory for the rest 
of the system at a sustained reading or writing rate of over 
10 million bytes per second. It controls the main time base 
algorithms, the trigger delay and holdoff counters, and a 
variety of interacquisition and intra-acquisition sample 
dithering circuits. 

Data Processing. In previous products all of the time-axis 
acquisition algorithms were hard-coded into hardware. 
Such a system was fast and inexpensive, but more difficult 
to change. Therefore, for the new oscilloscopes, instead of 
hard-coding the mathematics into the time base and VRAM 
controller IC. we opted to use an off-the-shelf digital signal 
processor, or DSP: Texas Instruments' 25-MHz TMS320C50. 
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Fig. 2. HP 546454 osrilloscope architecture Compared to Che HP 54000, The topology remains relatively unchanged. Only tlie logic 

partitioning is different. 



Like its predecessor, the DSP's job is to use the trigger infor- 
mation to determine where each of the samples stored in 
the VRAMs should be placed on the screen. Additionally, it 
handles all of the pan-and-zooni operations, allowing the user 
to redisplay the acquired data at other lime base or delay 
settings after the acquisition is complete. 

Mixed-Signal OscUloscope 

The idea behind the HP 54045D mixed-signal oscilloscope 
is to provide a truly seamless analog and digital waveform 
viewing instrument. A key goal for the project was to pro- 
vide a product that was greater titan the sum of just the two 
instruments. By combining an oscilloscope and a timing 
analyzer, we hoped to provide functionality and ease of use 
never before available in a laboratory quality oscilloscope. 

Careful attention was paid to knitting together the analog and 
digital halves of the mixed-signal oscilloscope. Anything you 
can do with the analog channels, you can do with the digital 
channels: viewing, automatic measurements, triggering, and 



so on. Both halves share the same time base and the trigger- 
ing system can operate on all 18 channels. 

Where the oscilloscope channels have a osrilloscope signal 
processor CMOS gate array to decelerate and postprocess 
the ADC data, the digital channels have a similar logic ana- 
lyzer signal processor CMOS gate array. However, the tech- 
niques for storing and plotting timing analyzer traces are 
radically different from oscilloscope traces. More specifical- 
ly, on the faster time base sett ings, oscilloscopes use a tech- 
nique known as equivalent lime sampling to build up a dis- 
play record over multiple triggers, resulting in an effective 
sample period of 50 ps or less (2.5 ps for the HP 54645A). 
Timing analyzer traces, however, are always single-shot — an 
entire waveform is reconstructed from each and every trig- 
ger, with an effective sample period exactly equal to the 
actual sample period. We therefore chose to double the max- 
imum sample rate on the logic channels to 400 MSa/s to 
provide better single-shot time resolution. Using the same 
50-MHz 16-bit VRAM memory as the HP 54645A, the HP 
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541)451) lias a total memory capacity of tWO million samples 
for each of eight digital channels (sampled at 400 MSa/s_) or 
1 million samples across all 16 digital channels (sampled at 
200 MSa/s). 

Use of Deep Memory 

It has long been recognized that one of the factors slowing 
the acceptance of the digital storage oscilloscope over the 
traditional analog oscilloscope has been the superior update 
rate of the analog oscilloscope. Update rate describes the 
number of waveforms that the instrument can display per 
unit time. Since update rate is representative of the percent- 
age of the input waveform thai actually gets displayed 
on the screen, it translates to the amount of information 
relayed to the user. In the past, digital storage oscilloscope 
manufacturers used a single-processor architecture to handle 
all user interface, acquisition, and display tasks. This has 
resulted in digital storage oscilloscopes gaining a reputation 
lor having far inferior update rale capability compared to 
analog oscilloscopes. 

i )ne nf the advantages that digital oscilloscopes hold over 
analog oscilloscopes is their ability to store the data for 
closer examination after the acquisition is stopped. Digital 
oscilloscopes with deep memory have been gaining popular- 
ity because of their ability to store a given amount of time 
with more timing resolution than oscilloscopes with less 
memory. The deeper an oscilloscope's acquisition memory, 
the more samples per unit lime il can capture. To make full 
use of the deep memory, digital oscilloscopes provide the 
capability atpCMtoing { moving the display window earlier or 
later in time) and zooinintj (changing the display window to 
display more or less time). This allows the user to display 
the portion of the acquisition that is of interest, with the 
desired amount of detail. 

1 ii fortunately, the update rate of the oscilloscope normally 
Suffers with deep-memory acquisition architectures. The up- 
date rale is inversely proportional to the amount of memory 
captured. Since every data point processed has to be read 
out of an ADC convener and placed in capture memory by 
the acquisition system, more points translates to more lime 
to process those points. The deep-memory digital storage 
oscilloscopes on the market before the HP 54(545A/D always 
place the update rate/memory depth trade-off in the hands 
of the user by means of a human interface control for the 
acquisition memory depth. If the user is interested in pan- 
ning anil zooming the acquired data after stopping the acqui- 
sition, the memory depth can be increased to gain more 
timing resolution for this task. If the user is only interested 
in \iewing repeated acquisitions of the signal, the memory 
depth can be decreased to improve the update rate. This 
requires that the user have an understanding of the trade- 
offs involved and a knowledge of the user interface so thai 
the changes can be made. 

The HI 1 64645A/D oscilloscope architecture is designed to 
optimize the memory depth for the user's requirements. If 
the user is viewing repeated acquisitions of the signal, the 
memory depth is decreased if necessary to maintain the 
maximum update rale. If the user slops the acquisition, the 
memory depth of the last acquisition is changed to use all of 
the available acquisition memory'. This feature is made pos- 
sible by the same architectural feature that was designed to 
maximize the oscilloscope's update rate. During most 



continuously acquiring configurations, half of the acquisi 
lion memory is used to write newly acquired daia while the 
Other half is being read into the display. Since the architec- 
ture is fundamentally based on a 1-million-point memory 
system, 500,000 points are dedicated to the next trigger 
while 500,000 points are being read to the display. 

If memory' depth has been traded off for update rate, a subset 
of 500.000 points is used for each trigger during continuous 
acquisition mode. For example, at 200 us/div, the acquisition 
time required to fill the screen with an acquired waveform is 
200 us/div x 10 divisions = 2 ms. The time required to ac- 
quire 500,000 points at 200 MSa/s is the sample period limes 
the number of points, or 5.0 x 10 -9 x 500.000 = 2.5 ms. Since 
the lime required lo capture 500.000 points is larger than the 
lime required lo fill the screen, we choose lo reduce the 
number of acquired points lo maintain maximum Update 
rate. In this case. 400.000 points can be acquired in 2 ms 
(400,000 x 5.0 x lfr fl = 2 ms), so we acquire 400,000 points. 
This maintains the theoretical update limitation of 500 wave- 
forms per second (1/0.002 = 500). A more extreme trade-off 
occurs at 5 us/div. Only 10,000 points can be captured in the 
50-us acquisition lime ( 10,000 x 5.0 x 10"" = 50.0 x Ifr 6 ). 

If the user presses the Run/Stop key wiiile an acquisition is in 
progress, the assumption is made that this may have been 
done to examine the last acquisition in more detail. The lasl 
acquisition that occurred before the user's action is stored in 
one half of the memory system. Since it cannot be assumed 
thai more triggers will follow the user's action, this half of 
the memory may not be written to any more after the Run/ 
Stop key is pressed. However, since the other half of the 
memory system now contains defunct data, the acquisition 
system is reconfigured to use the entire 500,000 points avail- 
able in this other memory half, and keeps acquiring data, 
looking for a trigger. Since the primary task to be executed 
when the user presses the Run/Stop key is to stop acquisition 
in what appears to be an instantaneous fashion, the system 
cannot wait forever for an additional trigger. Therefore, a 
timeout is initiated that will cause the system to cease look- 
ing for additional triggers after 100 ms. For the oscilloscope's 
acquisition system, this is a large window of opportunity to 
capture a deep-memory record. However, if no additional 
triggers occur, the acquisition system is brought to a stopped 
state, and the shorter acquisition obtained on the last trigger 
is used for post acquisition pan and zoom. Since most appli- 
cations involve relatively frequently occurring triggers, this 
methodology will result in most users viewing deep-memory 
acquisitions every time they stop their oscilloscopes. For 
applications in which t he rate ai which triggers occur is un- 
certain, or in which there is only one trigger event, the user 
can press the Single key and guarantee a 1-million-sample 
acquisition for every trigger. 

The Decimation Problem 

Even with a million points of sample memory, at the slower 
lime base settings, there will still be more samples across 
the screen than can be stored. We need to prune (decimate) 
the available data down to the memory' size, then further 
thin the stored data down into 4000-point records for rapid 
display. In these situations, when the ADC's are sampling 
faster than the data can be used, the oscilloscope is said to 
be oncrsampling. 
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Fig. 3. Various decimation techniques shown at 100:1 oversainpling. 
(a) ADC samples, (b) Simple decimation, (c) Intra-acquisition dither, 
(d) Peak detection, (e) Smoothing. 

There are a variety of tec hniques available to perform this 
decimation process, each portraying a different representa- 
tion of the input signal. Some methods will tend to highlight 
the subtle variations in the signal, others show the signal 
extremes, and others hide the signal extremes. As an exam- 
ple, suppose that the sample rate of the ADC is a factor of 
100 greater than the desired store rate. Fig. 3 shows various 
decimation techniques for 100: 1 oversampling. 

Simple Decimation. I >nc sample is stored and the next 99 
ignored; I hen the next sample is stored and the next 99 
ignored, and so on. This approach is illustrated in Fig. 3b. 
Since il is the simplest to implement, it is the most common 
decimation technique in digital oscilloscopes. Unfortunately, 
because of the very regular spacing of the stored samples, 
il is exceptionally prone to aliasing. Therefore, the HP 
54645A/D rarely use this technique (only when calculating 
FFTs. when exact sample spacing is called for). 

Intra-Acquisition Dithering. Rather than Store lite fitSl sample 
during the sample interval, this patented technique stores a 
randomly selected sample for each interval. In Fig. 3c, sample 
#33 is stored during the first interval, then #69 during the 
second, then #2 in the third, and so on. This technique, used 
in all of the 546xx oscilloscopes, is remarkably effective 
against aliasing. The stored samples are not evenly spaced, 
so it becomes much more difficult for the sampled signal to 
"lock" to the samples. The HP 54645A/D oscilloscopes use 
this technique with an improved folding pseudorandom 
number generator. Il is used whenever the instrument is in 



the normal display mode. For more information on this sam- 
pling technique, see the article on page 26. 

Peak Detection. Another common data compression technique 
is to store the minimum and maximum values of all of the 
samples in the sample interval, as shown in Fig. 3d. Widely 
used in digital oscilloscopes, this technique is called peak 
detection. When this technique is in use, rare, infrequent 
excursions are never ignored (lost), as they might be in the 
preceding two methods. This approach, however, tends to 
over-emphasize noise on the displayed record. Peaks are 
exaggerated and baselines become fatter at the expense of 
signal details. For example, an AM signal, peak detected, 
would show the modulation envelope quite clearly, but 
would lose the shape of the carrier wave. Statistical infor- 
mation about where the signal spends most of its time 
between the minimum and maximum values is lost. 

The HP 54645A/D oscilloscopes combine traditional peak 
detection with intra-acquisition dithering when the instru- 
ment is in peak detection mode. They simultaneously ac- 
quire and plot both sampled versions of the incoming signal. 
The peak detected version highlights the signal extremes 
(one minimum/maximum pair per pixel column), while the 
denser conventionally sampled record (six points per pixel 
column) provides statistical information about the signal. 

Smoothing. Yet another approach for decimating the incoming 
samples is shown in Fig. 3e. Here, rather than store the signal 
extremes for each interval, the average value is stored. This 
is logically equivalent to a simple low-pass boxcar filter cas- 
caded with a decimating filter. At the slowest sweep speeds, 
millions of samples are averaged together for every point 
drawn on the screen, even on single traces. This display 
mode is useful for pulling the signal out of the noise. 

While Smoothing reduces the noise in the signal in a manner 
similar to conventional averaging, there are some differ- 
ences. Smoothing works on signals acquired with a single 
trigger, while averaging requires multiple acquisitions to be 
effective. Smoothing functions as a low-pass filter with die 
cutoff frequency depending on the time base setting of the 
oscilloscope. When possible, the HP 54645A/D oscilloscopes 
use smoothing decimation when conventional averaging is 
Hinted on to provide additional noise reduction. 

Logic Channel Decimation (Glitch Detection). The acquisition 
system of the IIP 545451) mixed-signal oscilloscope has 
10 logic channels. Decimating the series of Is and 0s for 
these logic channels provides its own challenges. Simple 
decimation teciuuques would lose narrow glitches, so a 
peak detection technique (known as glitch detect ion in the 
logic analyzer domain) is always used. Two bits are suffi- 
cient to encode any sample interval, one value for each state 
(high and low). Not suffering from the drawbacks of peak 
detection on oscilloscope traces, logic waveforms are 
always acquired and plotted using glitch detection encoding 
regardless of the display mode selected for the oscilloscope 
channels. 

Trigger System Features 

Perhaps the most difficult task of the design was the trigger 
architecture for the mixed-signal oscilloscope. It needed to 
be a mixture of both analog anil digital trigger systems. The 
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traditional oscilloscope trigHerinK modes (rising or falling 
edges on any channel, willi a variety of coupling and noise- 
reject selections) needed to be coupled with the logic trig- 
gering modes (pattern triggering, sequencing, Boolean AND/ 
OR. and so on). But more significant, all of the cross-domain 
triggering circuits needed to be defined and developed. 

Development of this cross-domain triggering system archi- 
tecture and choices about which trigger modes to support 
were guided by two key design objectives. The first of 
these was to seamlessly integrate the trigger configuration 
for all channels, whether analog or digital. This allows any 
channel to be used as a trigger source in any trigger setup. 
A second objective was to extend this seamless trigger con- 
figuration to a full set of trigger features. This feature set 
extends far beyond traditional single-channel, edge-mode 
triggering to include functionality essential in a mixed-signal 
test environment. 

In mixed-signal testing, digital signals are often grouped 
together to form a common data or address wortl. Conse- 
quently, pattern triggering using terms including all channels 
was an obvious need. Inclusion of analog channels in patterns 
also provides the unique capability to gale a digital address 
trigger specification with analog-level conditions. Use of a 
pattern-qualified edge term (pattern AND edge) is a second 
key capability, since it allows a typical read or write to be 
captured by specifying the address in the pattern specifica- 
tion and the edge as the read or write strobe. Specification 
of trigger sequences in the mixed-signal oscilloscope is a 
further capability that is especially useful in capturing cause- 
and-effect relationships between analog and digital signals. 
Finally, patient duration and Boolean combinations of trigger 
terms are included. 

Trigger Types 

Design of a cross-domain trigger system to meet these objec- 
tives was influenced by trigger methods specific to timing 
analyzers and digital oscilloscopes. Typically, timing (and 
logic) analyzer's triggering on pattern or complex logical 
terms assume that all trigger events are simultaneous with 
clock transitions (in our case the ADC sample clock). This is 
referred to as synchnmotis trigger genera/ ion. In the analog 
world of the oscilloscope, all trigger events are assumed to 
be asynchronous to clock edges Since each of these methods 
has advantages, the final implementation of the HP 546451) 
trigger hardware uses two independent trigger systems, one 
for synchronous and one for asynchronous triggering. 

Trigger features typical of timing analyzer capability include 
pattern triggering, sequences of pattern terms, pattern dura- 
tion, and logical combinations of pattern and edge terms, 
hi a timing analyzer, digital channels are normally grouped 
together and trigger conditions are evaluated only at sample 
clock intervals. There are several advantages to this type of 
synchronous sample clock triggering. One is that trigger 
events are always visible on the display because events that 
occur between sample clocks cannot cause triggers. Also, 
the signal transition causing the trigger event to occur will 
not display any jitter at the trigger lime because that time 
occurs at a sample clock edge. Furthermore, complex trig- 
ger sequences can be easily implemented in hardware using 
sequential logic. Finally, it is easy to reject spurious patterns 
by qualifying pattern detection over multiple samples. 
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Fig. 4. Effect <>f synchronous triggering on the Waveform display, 

In a mixed-signal oscilloscope, triggering synchronously 
with the sample clock also has some significant drawbacks. 
Above all, it degrades equivalent time sampling operation 
for analog waveforms. This occurs because equivalent lime 
sampling relies on random distribution of trigger events with 
respect to the sample clock and uses multiple triggered 
waveforms to enhance the analog display quality. The analog 
waveform display, when synchronously triggered at fast 
time base speeds, appears as a broken trace, with bunching 
of the signal at sample clock intervals (see Fig. 1). Since 
the IIP 54645D samples at 200 MSa/s. degradation of wave- 
forms is only severe at time base speeds of 20 ns/div or less. 
Furthermore, single-shot trigger captures are unaffected by 
synchronous triggering of the waveform. 

A second drawback of synchronous triggering is that if the 
sample rate is too low, trigger events can be totally missed. 
This is not a problem in the HP 54645D mixed-signal oscillo- 
scope because the sampling interval (5 ns) is smaller than 
the minimum width of any pulse passed by the 100-MIfe 
analog bandwidth ( -7 ns). 

Asynchronous trigger generation, traditional in digital oscillo- 
scopes, provides precise trigger timing information to the 
waveform display. This allows the use of equivalent time 
waveform reconstruction to combine multiple trigger events 
and optimize signal display for repetitive signals al fast lime 
base speeds. In addition, trigger lime resolution for single- 
shot captures is limited only by analog channel bandwidth, 
rather than by sample time granularity, However, asynchro- 
nous trigger generation does have one significant drawback. 
It is possible thai the event thai generates the trigger may 
not be visible in the waveform display. This possibility exists 
because an analog event may cross a trigger threshold and 
then fall below the threshold again between successive sam- 
ple clocks. In the IIP 546451). Ibis effect is minimized because 
the analog bandwidth of the trigger path limits detectable 
input pulse widths to about 7 ns (depending on threshold 
level) while the minimum sampling interval is 5 ns. 

Mixed-Signal Oscilloscope Trigger System 
Architecture 

To achieve optimal signal display quality and maximize trig- 
ger capabilities, the HP 54645D mixed-signal oscilloscope 
uses both synchronous and asynchronous triggering. The 
hardware architecture, shown in Fig. 5, includes dual trigger 
paths that implement tltis trigger capability. The synchronous 
trigger path includes a full set of pattern and edge detectors. 
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Fig. 5. HP 54645D trigger system block diagram. 



which allow independent detection of all trigger terms. A pat- 
tern duration block is also included to allow triggers based 
on pattern duration less than target time, greater than target 
time, and in a target time range. An important feature of this 
path is the inclusion of high-frequency pattern rejection 
blocks. These blocks require that patterns be present for a 
fixed number of sample periods. This enables rejection of 
patterns that are only present at bus transitions because of 
channel-to-channel skew. An example of this skew in shown 
in Fig. 6, where a transition is made between address values 
of D7 and A4 (hexadecimal). Since the sample clock is not 
synchronized with the address transition, the sample clock 
will occasionally occur during transition states (in this case 
the address B6 ) and trigger on these patterns unexpectedly. 
With the high-frequency reject enabled, these triggers are 
rejected because they are only present for very short times 
(typically less than 1 ns). The pattern rejection block is auto- 
matically programmed by the CPU to require a minimum of 
two samples for all pattern definitions with more than one 
defined level. Outputs from the trigger terms detected are 
then routed through combinatorial logic blocks, sequencing, 
and holdoff. The design of this path allows for a completely 
orthogonal set of trigger logic terms. 

The asynchronous trigger path provides similar trigger func- 
tionality, including edge trigger, pattern detection, pattern- 
qualified edge detection, pattern duration, and Boolean logic 
combinations of trigger terms. Since this path lacks multiple 
edge detection blocks, only single-edge trigger terms can be 
configured. Also, only one pattern-qualified edge detection 
can be configured, The ability to do cross-domain triggering 
with synchronously configured pattern terms arming the 
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asynchronous trigger path significantly broadens the config- 
uration choices. For example, an advanced pattern definition 
of (PI AND El) AND (NOT = P2I can be implemented by synchro- 
nously detecting PI AND (NOT = P2) and using this to arm the 
asynchronous block, which then detects El (see Fig. 7). 

Based on the current trigger term definitions and channel 
display selections, the mixed-signal oscilloscope CPU con- 
figures either the synchronous or the asynchronous trigger 
path as the trigger source for the instrument. To optimize 
the analog channel display quality, triggers are set up asynch- 
ronously, if possible, for all cases in which analog channels 
are on or analog edges are used in trigger terms. This con- 
text-sensitive implementation allows the use of equivalent 
time sampling to enhance analog waveform display in all 
possible cases. Consequently, waveform display is optimized 
without the need for the user to know the details of the 
trigger configuration. 

Mixed-Signal Oscilloscope Trigger Applications 

Typical trigger applications for the HP 54645D mixed-signal 
oscilloscope use the rich set of triggering functions to capture 
events combining analog and digital criteria. The following 
paragraphs present specific examples that, show the unique 
capabilities of the mixed-signal oscilloscope trigger system. 

ADC Performance Evaluation. Dynamic ADC performance can 
be evaluated using either pattern triggering to trigger on 
specific digital outputs or edge triggering with the analog 
threshold set to the level of interest. By providing an input 
sawtooth waveform, slew rate effects, missing codes, and 
nonlinearities can easily be evaluated. 

Gated Processor Address Read/Write. An edge-qualified pattern 
detection can be used to detect a processor read or write to 
a specific address. By using analog channels in the pattern 
definition, it is possible to gate the trigger so that only pro- 
cessor reads and writes are displayed that occur when an 
analog signal is in a specific state. For example, in an alarm 
system in which a processor normally polls critical sensors 
to determine if alarm conditions exist, the use of a gated 
pattern and edge trigger could allow a display of all proces- 
sor activity starting with (triggering on) the first processor 
read for which the sensor state indicates an alarm. 



Glitch/Pattern Duration. Glitch triggering allows triggering on 
the duration between the positive and negative edges of a 
single signal. This allows capture of pulses less than a spe- 
cific pulse width (glitches), greater than a specific pulse 
width, and within a specific pulse width range. This is very 
useful for capturing events that are in or out of specific tim- 
ing requirements. Pattern duration allows the same duration- 
triggering choices but combines up to eighteen channels in 
the pattern definition so that triggering on the timing of 
combined channels is obtained. A specific example of using 
pattern duration is in capturing timing violations for a data 
handshake (see Fig. 8). The time for data acknowledgment 
after data valid must be less than a maximum time T. Viola- 
tion of this timing could be found by setting pattern terms 
DAV low and DTACK high and looking for pattern duration 
greater than T. 

Interrupt Response/Latency. Using a trigger sequence such as 
El THEN P1 AND E2 allows triggering on a processor response 
to an interrupt by first triggering on a hardware interrupt to 
a processor and then on a write to a register that is the ulti- 
mate response of the processor to the interrupt. The value 
written out can be read and subsequent events can be traced. 
Since the mixed-signal oscilloscope captures negative-time 
(pretrigger) events, this trigger can also be used to measure 
interrupt processing latency. 

Register Write/Analog Response. In control applications, it is 
useful to be able to trigger on an analog signal that is con- 
trolled by the setting of a control register. This can also be 
achieved by a trigger sequence such as pattern AND edge 
THEN edge. For example in a motor control application, this 
trigger sequence can be used to trigger on a write to a control 
register followed by a motor commutation. 

Display Enhancements 

In the HP 546xx family of products, waveform update and 
display have always been high priorities in the design. Tradi- 
tionally, digital oscilloscopes have had a slow update rate of 
data to the screen. In the HP 54645A/D the update rate has 
been increased to approximately three million points per 
second. This gives a much closer representation of the way 
an analog oscilloscope displays waveforms. For instance. 
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Fig. 8. Pattern duration triggering to find a timing violation. 

on an amplitude modulated signal, an analog display would 
show the upper and lower boundaries with a filled-in area 
between them. On a traditional digital oscilloscope you 
would only see a few waveforms inside those boundaries. 
However, because of the increased update rate on the HP 
546xx family, you see the boundaries and the area inside 
filled with data, a display that matches the traditional analog 
oscilloscope much more closely. This helps by displaying 
waveforms more closely to their true nature, but by itself 
does not address other features of an analog display such 
as its inherent intensity variation as a function of slew rate. 

Analog oscilloscopes generally use a cathode-ray tube (CRT) 
in which an electron beam is swept across a display ai a 
fixed speed while being deflected in the vertical direction by 
an analog signal. Because of this, areas of the waveform that 
have a high slew rate, such as edges, are displayed at a lower 
intensity than areas such as the top of a waveform, where 
the slew rate is low. This intensity variation provides infor- 
mation to the user that is not available with a traditional 
digital oscilloscope. For example, square waves displayed 
on an analog display are brighter ai the top and bottom where 
the slew rate is low, and quickly changing edges are at a 
much lower inteasity. For another example, consider a wave- 
form that has a lot of baseline noise. On an analog display, 
the persistence of the phosphor causes the baseline to be 
very bright and infrequent noise to be dim. On a digital dis- 
play, the noise is accentuated because all of the data points 
are displayed with the same relative intensity. This presents 
the digital oscilloscope designer using a raster display with a 
real challenge. How to display the waveforms to give a more 
accurate representation of their time-varying nature? 

t me thing to note is that the analog oscilloscope doesn't 
display infrequent signals very well. Because the phosphor 
has a limited persistence, the waveform will fade after a short 
time and if not redrawn will not be visible. One attempt to 
solve this problem is the analog storage oscilloscope, which 
uses a special plate in the CRT that has a much longer per- 
sistence. This is not a perfect solution because die wave- 
form tends to bloom on this type of display and will still lade 
over time. It is also fairly expensive. A solution in the digital 
realm has been to use color, displaying data points that are 
sampled frequently in a bright color that is dimmed at a 
Computed fade rate to simulate a long-persistence phosphor. 
However, color displays are more expensive, and more 



memory is needed to keep the counts for each data point- 
This can also be done with a monochrome display by varying 
the brightness of a data point in time, but more memory is 
still required, and in both cases an increased computational 
burden is placed on the display softw are. None of these 
methods addresses the problem of displaying different slew 
rates at different intensities. 

The HP 54645A/D oscilloscopes use a proprietary variable- 
intensity algorithm to control the intensity of each data 
point. Adjacent points in the same raster row are used to 
set the intensity of the current point. Therefore, a point with 
horizontal neighbors will be brighter than if it has none. In 
the HP 54645A/D oscilloscopes, two intensity levels are used: 
full-bright and half-bright. When a point has a neighbor on 
both sides it is displayed full-bright. When it has zero or one 
neighbor, it is displayed half-bright. Thus, a high-slew-rate 
edge is displayed with less intensity because it has no hori- 
zontal neighbors. Looking again at the square wave, the 
edges will be displayed at a lower intensity than the top and 
bottom of the signal, which results in a display that looks 
much more like the analog oscilloscope's representation of 
the waveform. For the signal with a noisy baseline, the noise 
away from the baseline is displayed at a lower intensity 
because the data points do not have horizontal neighbors. 
The cost to implement this enhancement is greatly reduced 
because inexpensive printed circuit board hardware is used. 
The update rate of the waveforms is not affected because 
this method can be used on die fly as data points are being 
displayed. 

One other enhancement is used when die HP 54645D displays 
logic waveforms. These traditionally have a high value and a 
low value with edges drawn between them when the wave- 
form changes. On a raster display, we have a fixed horizontal 
resolution in which to place the data points. The display is 
divided into rows and columns of addressable points called 
pixels. Suppose a point falls at the boundary between two 
horizontal pixels. On some acquisitions it will be displayed 
in one pixel, on others in the adjacent pixel. This results in a 
double-wide full-bright edge. Compared with a point that 
falls in the middle of a pixel, it will seem brighter. On a 
display with many logic waveforms (his adds a perceptible 
flicker. The HP 54645D solves this by displaying double-wide 
pixels half-bright and single-wide pixels full-bright. This has 
(he effect of equalizing the intensity and greatly reducing the 
perceived flicker. 

In all of these methods an attempt was made to give the HP 
54645A/D oscilloscopes a display that does its best to repre- 
sent the waveforms the user might encounter. 
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A Cost-Effective Architecture for a 500-MHz Counter for Glitch Trigger 



The HP 54B45A and 54645D oscilloscopes can trigger on pulses qualified 
by their width, a feature known as glitch trigger. The time qualification is 
done with a 500-MHz, 37-bit counter that provides 2-ns timing resolution 
over an 8-ns-to-100-s range. Operation at 500 MHz (a 2-ns period) is too 
fast for CMOS logic, but the 37-bit counter is too large for economical 
integration in bipolar logic. The clock for the counter must be asynchro- 
nous with the analog-to-digital sampling clock so that the time at which 
a trigger is generated is in no way related to the times of the samples. 
Otherwise, in random repetitive acquisition mode (also known as equiva- 
lent time sampling), the samples would not be randomly distributed with 
respect to the input signal and the oscilloscope waveform would contain 
missing segments. 

The block diagram in Fig. 1 shows how these requirements and constraints 
were satisfied. The least expensive way to obtain the 500-MHz glitch 
trigger clock was with a commercial single-chip phase-locked loop 
locked to the 50-MHz reference clock. The phase-locked loop synthesizes 
a 250-MHz clock, which is doubled to 500 MHz in the bipolar gate array. 
Phase locking to the reference clock guarantees that the glitch trigger 
clock is precisely synchronized with the reference. Since the sample 
clock is also locked to the reference but phase dithered, the glitch and 
sample clocks have random phase with respect to each other, preventing 
missing segments in the waveform. 

The clock doubler generates an active 500-MHz edge on both the rising 
and falling edges of the 250-MHz clock. The symmetry of the 250-MHz 
clock affects the uniformity of the periods of the 500-MHz clock. The 
jitter of the 250-MHz clock causes period jitter in the 500-MHz clock. 
The delay element t, formed with gate delays, determines the duty cycle 
of the 500-MHz clock. The asymmetry and period jitter errors of the 
500-MHz clock must both be less than 100 ps to preserve timing margins 
in the 500-MHz counter. The phase-locked loop IC alone cannot meet 
these requirements, so a bandpass filter is introduced that removes 
spurious components in the clock spectrum, improving the symmetry 
and reducing the jitter from 111 ps to 44 ps. 1 



For reasons of both power and cost, the 37-bit counter is too large to 
implement entirely in the bipolar gate array. By including only a few of 
the low-order, high-speed bits in the bipolar IC and the remaining bits in 
a CMOS gate array we are able to build a very economical, low-power 
500-MHz counter. The bipolar portion is called the fast counter and the 
CMOS portion the slow counter. Together these two operate as a single 
37-bit synchronous counter. 

The design of the counter is complicated because the CMOS IC must never 
see pulses narrower than about 10 ns and responds relatively slowly with 
the terminal count (TC) signal To overcome these limitations the fast 
counter operates in either of two modes: as a five-bit counter or as a four- 
bit counter. For counts from 1 to 31 the fast counter performs all the count- 
ing. For counts of 32 and up, the bipolar counter is always loaded with a 
count between 15 and 31 Since the fast counter is loaded with a mini- 
mum count of 15, the slow counter always has at least 15 x 2 ns or 30 ns 
to reload itself before it is called upon to begin counting. The fast counter 
counts down in five-bit mode to zero and then generates a SLOWCLK 
signal, triggering a count in the slow counter. It then switches to 4-bit 
mode and counts down, generating a SLOWCLK signal every 15 clocks 
Thus, the slow CMOS counter is clocked at only 500 MHz/15 or 33 MHz 
and has 30 ns in which to generate SLOWTC. These requirements are 
easily met with modern CMOS gate array technology. 
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Sustained Sample Rate in Digital 
Oscilloscopes 

At all but a few of the fastest sweep speeds, the acquisition memory 
depth and not the maximum sample rate determines the oscilloscope's 
actual sample rate. Peak detection capability, when used correctly, can 
make up for acquisition memory shortfalls. 

by Steven B. Warntjes 



One of the most basic specifications in digital oscilloscopes 
is the maximum sample rate. Oscilloscope users often under- 
stand I he theory of signal sampling and signal reproduction, 
bul often mistakenly assume that the oscilloscope always 
samples al its maximum sample rate. In reality. I wo specifi- 
cations need to be considered: the maximum sample rate and 
the acquisition memory behind Ihe signal sampler. Ai all bin 
a few of the fastest sweep speeds, the acquisition memory 
depth and not the maximum sample rale determines the 
oscilloscope's actual sample rate and consequently how ac- 
curately the input signal is represented on the oscilloscope 
display. The deeper the acquisition memory, the longer the 
oscilloscope can sustain a high sampling frequency on slow 
time-per-division sel lings, thus increasing I he actual sample 
rate of Ihe oscilloscope and improving how the input signal 
looks on the oscilloscope screen. 

The digital oscilloscope's peak detection specification is an- 
other often overlooked specification. This important feature, 
when used correctly, can make up for acquisition memory 
shortfalls. In addition, peak delect ion can be combined with 
deep acquisition memory (0 create unique advantages in 
digital oscilloscopes. 

Digital Oscilloscope Sampling 

Basic sampling theory states that for a signal to be faithfully 
reproduced in sampled form, the sample rate must be greater 
than twice the highest frequency present in the signal. This 
sample rate is known as the Nyquist rate. 1 For an oscillo- 
scope, this means that if the user wants lo capture a 100-MHz 
signal, the sample rate should be at least 200 MSa/s. While the 
theory states that greater than 2 x sampling is sufficient, the 
practicality is that lo reproduce the input signal with 2 x sam- 
pling requires complex mathematical functions performed on 
many samples. Rprmislriirtinii is the term commonly given to 
the complex mathematical functions performed on the sam- 
pled data to Interpolate points between Ihe samples. In prae- 
lice, i iscilloscopcs that do reconstruction typically have less 
than perfectly reproduced waveforms because of imperfect 
application of the mathematical theory.* and they may have 
a slow waveform update rate because of the time it lakes to 
do the necessary compulations. 

" Etiors are attributable to liifjh-trequency components in iho captured siflnal and a tinite 
number ol samples 



One solution to the reconstruction problem is to sample the 
waveform at a rate much higher then the Nyquist rate. In the 
limit as sampling becomes continuous, as it is in an analog 
oscilloscope, the waveform looks "right" and requires no 
reconstruction. Consequently, if the digital oscilloscope can 
sample fast enough, reconstruction is not necessary. In prac- 
tice, a digital oscilloscope rule of thumb is that 10 x over- 
sampling Ls probably sufficient not to require reconstruction. 
This rule dictates that a digital oscilloscope with 100 MHz 
of bandwidth would require an analog-to-digital convener 
( ADC ) to sample the input signal at 1 GSa/s. In addition, the 
acquisition memory to hold the samples would have lo 
accept a sample every nanosecond from Ihe ADC. While 
memories and ADCs in this frequency range are available, 
they are typically expensive. To keep oscilloscope prices 
reasonable, oscilloscope manufacturers minimize the mnounl 
of fast memory in their oscilloscopes. This cost minimization 
has the side effect of dramatically lowering the real sample 
rate at all bul a few of the fastest time-per-division settings. 

The sampling theory presented above assumes that the 
sampler only gets one chance to "look" at the input signal. 
Another solution to the reconstruction problem is to use 
repetitive sampling. In repetitive sampling, the waveform 
is assumed to be periodic and a few samples are captured 
each time the oscilloscope sees the waveform. This allows 
Ihe oscilloscope l<> use slower, less expensive memories and 
analog-to-digital conveners while still maintaining very high 
effective sample rates across multiple oscilloscope acquisi- 
tions. The obvious disadvantage is that if the waveform 
is not truly repetitive the oscilloscope waveform may be 
inaccurately displayed. 2 

Digital Oscilloscope Memory Depth 

Now that we understand digital oscilloscope sampling re- 
quirements, we can examine how memory depth in oscillo- 
scopes affects the sample rale at a given sweep speed or 
time-per-division setting. 'Hie lime window within which a 
tligital Oscilloscope can capture a signal is Ihe product of Ihe 
sample period and Ihe length of the acquisition memory. The 
acquisition memory is measured by the number of analog-to- 
digital converter samples it can hold. Typically in digital 
oscilloscopes the length of Ihe acquisition memory is fixed, 
so to capture a longer time window (adjust the oscilloscope 
for a slower time-per-division setting) the Sample rale must 
be reduced: 
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Fig. l. Effect of memory depth on the maximum $ample rate 
ofa digital oscilloscope. Oscilloscope A has a memory rk'jjtli of 
1110(1 samples. Oscilloscope B hfLs a l.OOO.OOO-sainple Mieniory. 

tcAITL'RE = k.MEMi )UVTs. 

where tcAPTDRE is time window captured liy the oscillo- 
scope, k^]g\n )i;y is the length of the acquisition memory, and 
f s is the sample rate. 

In Fig. 1 we have oscilloscope A with a maximum sample 
rati' Of 200 MSa/s and an acquisition memory depth Of IK 
samples. Oscilloscope A samples ai its maximum sample 
rate specification of 200 MSa/s or 5 nanoseconds pet sample 
from 5 ns/div to 500 ns/div. As the time-per-di\ision increases 
beyond 500 ns/div we see that the sample rate drops. At 
I millisecond per division the oscilloscope's sample rale has 
dropped to 10 us per sample or 100 kSa/s. This dramatic 
drop in sample rate at slower sweep speeds leads to oscillo- 
scope displays of fast signals that don't accurately represent 
the input signal. 

Fig. 1 also shows the sample rate behavior of oscilloscope B. 
which differs from oscilloscope A only in that the memory 
depth has been increased from 1.000 samples to 1.000.000 
samples. Oscilloscope B samples at its maximum sample 
rate of 200 MSa/s from 5 ns/div to 500 us/ciiv. On fast time 
base settings, oscilloscopes A and B both sample at 200 MSa/s. 
An important difference shows up at the slower time base 
settings. At I millisecond per division we see that oscillo- 
scope B samples at 100 MSa/s or 10 nanoseconds per sample 
while oscilloscope A lias dropped to 100 kSa/s. We can clearly 
see that the deeper memory ( x 1000 ) gives us the ability to 
capture events 1000 times faster at the slower time-per- 
division settings. The effect of deeper memory is the ability 
to sustain the maximum sample rate at much slower sweep 
speeds, in our example from 500 ns/div to 500 us/div. and 
sustain faster sample rates at sweep Speeds where the sample 
rate must be reduced because of memory depth limitations. 
This effect of increased acquisition memory gives the oscillo- 
scope higher sampling performance over a wider range of 
oscilloscope operation. 

Digital Oscilloscope Peak Detection 

Peak detection is a powerful feature that compensates for 
lack of memory' depth in digital oscilloscopes, hi peak detec- 
tion mode the oscilloscope looks at many samples but stores 
only the maximum and minimum v oltage levels for a given 
number of samples. The effective sample rate of the oscillo- 
scope in peak detection is normally the maximum sample 



rate of the oscilloscope on all sweep speeds * This enables 
the oscilloscope to capture fast input signal amplitude varia- 
tions by sampling at the maximum sample rate, yet save 
memory space by storing only the significant signal devi- 
ations. The value of peak detection is best illustrated with a 
simple example. In Fig. 2 we have a display ofa long time 
window with short-duration changes in signal amplitude. We 
are at a time-per-division setting of 20 seconds per division 
and are showing a 100-nanosecond event. To detect this event 
without peak detection would require at least a 1/(100 tls) = 
10-MSa/s sample rate. Likewise, to capture 200 seconds 
(20 s/div x 10 div) of information with 100-ns sampling 
would require a memory depth of 200/(100 ns) or two billion 
samples. Clearly, peak detection is useful for catching short- 
duration signal changes over long periods of time without 
large amounts of oscilloscope acquisition memory. In many 
instances peak detection can make up for acquisition memory 
slum falls. 

Traditional digital oscilloscope peak detection has I wo sig- 
nificant disadvantages. Firs! is the lack of liming resolution 
of the stored peak detected samples. As described above, the 
net peak detection sample rate is the maximum oscilloscope 
sample rate divided by the number of samples searched for 
minimums and maximums. This leads to a lack of timing 
resolution because the oscilloscope only stores maximum 
and minimum values. This 1-out-of-n selection results in a 
loss of information as to whether the maximum occurred on 
the first, last, or mill sample of the samples searched for the 
maximum and minimum voltage values. In Fig. 3 we have 
taken our Fig. 2 waveform and expanded the lime scale to 
display the maximum value. (The maximum value is shown 
as a solid bar. the available timing resolution. ) 

The second peak detection disadvantage is the apparent 
noise on the oscilloscope display. The storage of only maxi- 
mum and minimum voltage levels has the effect of making 
the input waveform appeal - to contain more noise or signal 
aberrations then are actually present. This is because the 
peak detection algorithm stores only these peak values, and 
not normal signal voltage levels. Traditional digital oscillo- 
scope peak detection gives the user a biased view of the 



Some oscilloscopes wilh digital peak detection rio nol use even/ possible sample in peak 
detection mode 
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Fig. 2. A display of a long tune w indow in peak detection mode, 
showing a I00-uai\osccnnd event al a tiine-pcr-divisioii setting of 
20 seconds per division. To detect this event without peak detec- 
tion would require at least a lU-MSa/s sample rate and a memory 
depth of two billion samples. 
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Fig. 3. The waveform of Fig 2 with the time scale expanded to 
display the maximuni value. The maximuni value is shown as a 
solid bar. the available timing resolulion Tin- IOO-ns pulse cap- 
tured across 200 seconds shows only approximately 1 ms uf timing 
resolution. 

input signal by overemphasizing the signal's infrequent 
amplitude deviations. 

Having considered traditional peak detection advantages 
and disadvantages, what do deep acquisition memory and 
sustained sample rate contribute to the peak detection prob- 
lem? Deep memory has three benefits when applied to peak 
detection. First is thai peak detection is needed less fre- 
quently. Deep acquisition memory allows the analog-lo-digiial 
converter to sustain a higher sample rate al all sweep speeds. 
This means that the user is required to switch into peak 
detection less often to catch short-duration signal variations 
reliably. The second advantage is increased tinting resolution 
of the peak detected samples, since the benefits of memory 
depth also apply to the peak delected .samples. The deep- 
memory oscilloscope can si ore many more minimum and 
maximuni pairs over a given time period, yielding a shorter 
lime Interval between the peak detected samples. The third 
deep memory peak del eel ion advantage consists of ihe addi- 
tion of regular (not peak detected) data displayed with Ihe 



traditional peak detected information. In this case the acqui- 
sition memory is segmented and both peak detected samples 
and regular samples are stored. The HP 54645 deep-memory 
peak detection display contains regular and peak detected 
samples. This has ihe effect of deemphasizing the peak 
detected samples since they appear as only a subset of the 
waveform display. While the displayed waveform still differ- 
entiates signal amplitude variations by always displaying 
peaks, it also gives the user a feel for the signal baseline by 
displaying normal waveform information. 

Summary 

Conventional digital oscilloscopes achieve their maximum 
sample rate only on their fastest sweep speeds. We have 
examined two key features (memory' depth and peak detec- 
tion ) that sustain this sample rate over a wider range of 
oscilloscope operation. Increased amounts of acquisition 
memory afford the user a much higher sample rate across 
sweep speeds that are nut at the oscilloscope's maximum 
sample rale and sustain the maximum sample rate of the 
oscilloscope to a larger number of time-per-division settings. 
This produces a more accurate waveform representation of 
the Input signal on ihe digital oscilloscope display across 
many sweep speed settings. We have also described the 
advantages and disadvantages of traditional digital oscillo- 
scope peak detection and pointed out some unique sustained 
sample rate and deep-memory peak detection advantages. 
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Acquisition Clock Dithering in a 
Digital Oscilloscope 



When a frequency component of the input signal is greater than half the 
sample rate, aliasing can occur. When the oscilloscope is equivalent time 
sampling, signals that are subharmonics of the sample clock will be poorly 
displayed. In the HP 54645A/D oscilloscopes, these effects are greatly 
reduced by dithering the sample clock during and between acquisitions. 



by Derek E. Toeppen 



A common concern of digital oscilloscope users is that there 
are combinations of oscilloscope settings and input signals 
that will cause I he Standard digital oscilloscope architecture 
to display a signal poorly or incorrectly. Since an oscillo- 
scope is a device intended CO display a variety of signals, 
sooner or later one of these combinations of settings and 
signals will be encountered, leaving the user confused and 
with diminished confidence in the instrument. 

The classic case occurs when the sample rate and input 
signal violate the Nyquist sampling theorem, or specifically, 
when a frequency component of the input signal is greater 
t han half the sample rate. When this happens, an aliased 
waveform will be displayed. A more subtle case occurs 



when the digital oscilloscope is random repetitive sampling 
(also known as equivalent time sampling).' In this case, sig- 
nals that are subharmonics of the sample clock will be poor- 
ly displayed. This occurs because the repetitive samples are 
not randomly distributed over the input signal, but rather 
are bunched together. 

There are ways of designing digital oscilloscopes that greatly 
reduce these effects. These techniques involve dithering I he 
sample clock during and between acquisitions (intra-acquisi- 
tion and interacquisition dithering). Two such techniques 
used in the design of the 54645A/D oscilloscopes will be 
discussed here. 
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Fig. 1. The development or a waveform during random repetitive sampling, ( a) The result or one acquisition of an input signal, 
(b I The same signal affpr three acquisitions (c) After seven acquisitions, (d) After 20 acquisitions. 
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InteracquLsition Dithering 

The underlying principle of random repetitive sampling is 
that there is no phase correlation between the sample clock 
and the signal being sampled. This principle ensures that 
samples taken of the signal are randomly distributed over 
the signal, and when accumulated over time, will develop a 
detailed reconstruction of the input signal. The accumulation 
of samples is illustrated in Fig. L Fig. la shows the initial set 
Of samples acquired during the first acquisition of the signal. 
The sample rate and input frequency meet the Nyquist crite- 
rion but that there are. nonetheless, large gaps between 
samples. Fig. lb shows the same signal after accumulating 
three acquisitions. Note the random placement of the sec- 
ond and third sets of samples relative to the first, set. The 
shape of the signal is becoming more defined. Fig. lc is after 
seven acquisitions and Fig. Id after 20. In Fig. Id. the signal 
shape is well-defined and accurately represented. 

The typical digital oscilloscope relies on the fact that the 
sample clock (l" s ) inside the oscilloscope is derived from a 
clock separate and independent from the signal being mea- 
sured (fj) to satisfy the criterion that there is no phase cor- 
relation between the two signals. However, nothing prevents 
the oscilloscope user from applying a signal that is the same 
frequency as the sample clock or subharmonically related 
to it ('fj = t/n ). When this happens, the sample points are no 
longer randomly distributed across t he input waveform as in 
Fig. I. This case is illustrated in Fig. 2. In Fig. 2. a sine wave 
with a frequency equal to precisely 1/10 the sample clock 
(fj = f s /10) is applied to a random repetitive sampling digital 
oscilloscope. The result is that all sets of samples accumu- 
late around the same locations, creating the bunching effect 
mentioned before. Since the acquisition clock and the Input 
sine wave are not phase-locked to each other, if the wave- 
form is allowed to accumulate long enough, the points will 
spread out, but the time it takes to do this will depend on 
the stability of the two sources. 

If the sample set acquired during the lirsl acquisition in Fig. 2 
is taken as a reference, then what is needed is a way to shift 
Subsequent sets of samples so that they fall in between the 
points Of the first set (as they did in Fig. 1 ), or, viewed in 
terms of the phase of the sample clock, to spread subsequent 
sets over the 300 degrees of phase between I he initial sample 
points. This can be achieved by shifting the phase of the 
acquisition clock after each acquisition 
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Fig. 3. (;u Phase-Inched loop. HO Phase error voltage injected 
into the loop. 

A circuit ideally suited for this task is the phase-locked loop. 
The basic phase-locked loop is illustrated in Fig. 3a. In this 
circuit, the phase of the reference cloc k is compared to the 
phase of the Output of the voltage-controlled oscillator (VCO). 
The loop filter drives the VCO input to cause die phase dif- 
ference between the reference clock and the VCO output to 
be zero. If an error voltage is injected into the filter at the 
output of the phase comparator, as in Fig. 3b, then the error 
Mill age appears as a phase error to the loop. The loop will 
adjust the VCO to create a real phase error to cancel the 
injected error voltage, thereby generating a nonzero phase 
difference between the reference clock and the VCO output 
If the VC( ) output is used as an acquisition clock, then Ihe 
error voltage node can be used to create phase shifts in the 
acquisition clock. 

A block diagram of Ihe phase-locked loop circuit used in Ihe 
IIP54(545A/D to produce this type of intoracqtiisition clock 
dither is shown ill Fig. 4. In this circuit, the error voltage is 
generated by a .H-hil digilal-lo-analog converter (I)AC). This 
provides 2 ,H or 2o(i discrete phase shifts in Ihe acquisition 
clock. The 8-bit digital word written to the I)A< ' is pseudo- 
randomjy generated by one of the oscilloscope's processors to 
be consistent with Ihe nature of random repel ilive sampling. 
Fig. ") illust rales Ihe effectiveness of this technique. It is a 
plot of the sine wave of Fig. 2 using phase-loekod-loop- 
gencralcd interacquisition dilher of Ihe acquisition clock. 
All bunching Of the sample points has been eliminated. 
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Fig. 2. The result of applying a sine wave with a frequency equal 
lo precisely 1/10 the Sample clock frequency In a random repetitive 

sampling digital oscilloscope, mi sets of samples accumulate 
around the same locutions. 



Ultra-Acquisition Dithering 

Real-tune sampling is generally considered lo be Ihe solulion 
for aliasing in a digital oscilloscope. The reasoning is that if 
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fig. 4. Method of introducing acquisition 'lock dither En the 
IIP r>4iM- r >A/I> oscilloscope phase-locked loop to prevent Hit* 
bunching effect shown in Pig. 2. 
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Fig. 5. Acquiring the signal of Fin. - wiili uiteracquisitiori dither 
eliminates the hunching effect 

ill* - oscilloscopes sampling rale is always al least twice its 
bandwidth then aliasing cannot occur. However, what is 
commonly overlooked is that to overcome finite memory 
limitations, even real-time sampling oscilloscopes are forced 
to redtice their sample rales at some point to capture long 
lime periods (see article, page 23). When this occurs, the 
oscilloscope becomes susceptible in aliasing. 

A common technique for decreasing the sample rale in a 
digital oscilloscope is illustrated in the block diagram in 
Fig. (i. In this circuit, the digitizer always samples at the 
maximum rate but not all digitized points are stored into 
memory. The decimator between the digitizer and memory 
handles the selection of the desired digilized points. To re- 
duce the sample rate by the factor N, the decimator passes 
only every Nth point on to the memory. All oilier digilized 
points are lost This technique can be modified slightly in 
a way that will greatly reduce the likelihood of an aliased 
waveform developing. 

To understand Ibis, first consider how an aliased waveform 
is generated. If a sine wave with frequency f, is sampled with 
frequency f v where f, > f/2, then an aliased sine wave of 
Frequency equal to lfj-f s l will occur. For example, suppose 
that a 1.01-MHz sine wave is applied to a digital oscilloscope 
sampling at 1 MSa/s. An aliased frequency (or beat frequency) 
of 10 kHz will result. What is seen on screen is what appears 
to be an untriggered 10-kIIz sine wave. 

This phenomenon repeats itself at each harmonic of the 
sample dock, so if a frequency of 10.01 MHz, near the tenth 
harmonic of the sample clock, is input to the same oscillo- 
scope, the same aliased 10-kIIz signal is produced. However, 
between the samples taken at 1-us intervals (1/1 MSa/s). ten 
cycles of the input sine wave occur. When the decimal or 
process shown in Fig. (i is used to reduce the sample rate, 
samples of those cycles are thrown away. It is these dis- 
carded sample points that can lie used to prevent the aliased 
waveform. 
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Fig. 6. Decreasing the sample rate by decimation. Thp digitizer 
always samples at the maximum rate hut not all samples are stored 
into memory. 
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Fig. 7. (a) An aliased Hl-kHz sine wave produced when a ln.dl-MHz 
sine wave Is sampled al I MHz. (b) The same display using intra- 
acquisilion dithering (random decimal ion) is a fuzzy band much like 
whal would be seem on an analog oscilloscope, vviih all signs of an 
aliased waveform removed. 

Instead of Storing every Nth digitized point, the decimator 
can be designed to randomly select one out of every N points 
for storage. In the case of the 10.01-MHz input, the points 
placed in memory are points randomly selected from the ten 
cycles of the input that occur in every 1-iis Interval. This 
random sample selection technique effectively dithers the 
acquisition cloc k during the acquisition and prevents a beat 
frequency from developing. 

This intra -acquisition dithering technique has been used 
throughout 'he IIP54GXX oscilloscope product line and 
again in the HP54645A/D products. The effect it has on alias- 
ing is dramatic Fig. 7a shows the aliased 10-kHz sine wave 
that is produced when a 10.01-MHz sine wave Is sampled al 
1 MSa/s. Fig. 71) shows the same display using the dithering 
process just described. The resulting display is a fuzzy band 
much like whal would be seem on an analog oscilloscope, 
with all signs of an aliased waveform removed. 
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An Oscilloscope-Like Logic Timing 
Analyzer 

Market research indicated that some customers doing embedded 
development preferred to work with oscilloscopes instead of standard 
logic analyzers. The HP 54620 logic timing analyzer offers many 
oscilloscope features, including direct-access controls, a highly interactive 
display, computed measurements, delayed sweep, simplified triggering, 
and a trace labelmaker 

by Steven B. Wamtjes 



The principal contributions of the HP 54620 logic timing 
analyzer ( Fig. 1 ) are not in its performance specifications 
but instead in its user interface, optimized feature set. and 
price. These are significant contributions, as evidenced by 
the MP 54620's receiving the Vest & Measurement World 
Best in Test Award for 1995. Market research for the HP 
54620 indicated thai some customers doing embedded devel- 
opment work were frustrated with the COS| and complexity 
of standard logic analyzers and preferred instead to work 
with oscilloscopes. The IIP 54620 bridges the gap between 
standard logic analyzers and oscilloscopes by providing the 
functionality of a timing analyzer and the usability advantages 
of an analog oscilloscope. 



Product Description 

The HP 54620 Ls best described as 16-channel timing analyzer. 
Each timing analyzer channel has one bit of vertical resolu- 
tion indicating either a logic high level or a logic low level 
based on a user-selectable voltage threshold. The HP 54620 
leverages IIP second-generation logie-analyzer-on-a-chip 
technology to provide 500-MSa/s or 2-ns timing analysis on 
all 16 channels simultaneously. This acquisition engine and a 
custom display processor produce oscilloscope-like, high- 
Ihroughpui displays. Monochrome (HP54620A) and color 
(HP 54620C) display versions are available. 




Fig. 1. The IIP 54620 I'i-cliannel 
logic liming analyzer bridges the 
gap Ik-I ween standard logic 
analyzers and oscilloscopes by 
providing t he functionality of a 
timing analyzer an. I the usability 
advantages of an analog oscillo- 
scope. 
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The IIP 54620 leverages the CPU system and the mechanical 
design of the HP 54(500 Series oscilloscopes lor front-panel 
responsiveness and small form factor. The IIP 54620 is posi- 
tioned as a companion to the users oscilloscope and there- 
fore has the ability to be triggered from another instrument 
( (rigger in) and lo trigger another instrument (trigger out). 

User Interface Oscilloscope Similarities 

Most oscilloscope users will tell you thai their oscilloscope 
is relatively easy to use. While some oscilloscopes are more 
user-friendly than others, the general feeling is that oscillo- 
scopes are easier to use than logic analyzers. The primary 
reasons are twofold. First, oscilloscopes usually have direct- 
access controls, something not always true of logic analyzers. 
Second, oscilloscope displays are highly interactive because 
of their fast screen update rates. After all, they are primarily 
waveform viewing tools, and they feel very responsive. The 
HP 54620 leverages these two oscilloscope ease-of-use at- 
tributes and adds some standard oscilloscope features, pre- 
viously unavailable in logic analyzers, thai make the HP 54620 
feel and run like an oscilloscope. The project design goal 
statement was. "Any user who can use an analog oscillo- 
scope should quickly be able to use this new class of logic 
analyzer." 

Hi reel -access control of instruments means that the most 
common controls are available on the front panel, through 
dedicated knobs or buttons. With an oscilloscope, the user 
often reaches up to expand or contract the time window, 
often without thinking. This intuitive control is beneficial 
because it allows the user to concentrate on the problem at 
hand and not worn' about running the test instrument The 
front panel of (he IIP 51620. shown in Fig. 1. has direct con- 
trols for time per division, waveform position, and time base 
delay. Instruments that contain sev eral layers of menus 
under each front panel key can often confuse the user. The 
I IP 54020 addresses this concern by keeping most menus to 
a single level under a given front panel key. allowing the 
user more direct control of the instrument. 

The ability to display unexpected or undesired waveform 
conditions with a highly interactive display often leads the 
troubleshooter to a quicker solution of the circuit problem. 
Instrument displays With fast update rates seldom miss in- 
frequent or random events. If the update rate is slow, say 
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Fig. 8,TheHP54tS0!8wulti- 
processor architecture yields a 
highly interactive display thai 
updates I lie screen about If! 
limes per second as a result of 
parallel professing and acqiusi- 
lion of the waveform data. 



less than five updates per second, the instrument can fail to 
capture important waveform events because it is processing 
and not acquiring data when they occur. The IIP 54620's 
multiprocessor architecture (Fig. 2) yields a highly interactive 
display that updates the screen about 16 times per second as 
a result of parallel processing of the waveform data. 

An important element of oscilloscope operation is how re- 
sponsive it is to control changes. After a control such as 
the lime per division is changed, the instrument should re- 
spond quickly to prevent user confusion. The multiprocessor 
architecture of the HP 54620 helps address instrument 
responsiveness. 

To make the HP 54620 feel and run like a oscilloscope it was 
necessary lo add a number of features that are standard in 
digilal 0SCiIlOSC0p.es. One of the advantages that digital 
oscilloscopes have over analog oscilloscopes and traditional 
logic analyzers is the ability to do automatic measurements 
such as signal frequency, period, duty cycle, and so on. 
Since the waveforms are stored in memory, the oscilloscope 
can poslprocess the waveform lo determine the frequency 
of the stored signal. Oscilloscope users have come to expect 
these measurements in digital oscilloscopes. Consequently, 
automatic measurements for frequency, period, duty cycle, 
positive width, and negative width are implement eil in I he 
IIP 54620. 

Delayed sweep capability is considered standard in the minds 
of some digital oscilloscope users and therefore is imple- 
mented in the HP 54620. Delayed sweep is the ability to split 
the instrument screen horizontally to show the waveform at 
two different time-per-division settings concurrently, Delayed 
sweep is valuable because of the additional insight that is 
often gained from the ability to see the waveform at different 
levels of abstraction. Fig. 3 shows many cycles of a pulse 
train in the main window with a specific cycle demonstrating 
im error in I he delayed window. In this case the ability to 
view the big picture along with specific "little picture" detail 
gives additional information about the waveforms under 
test. 

Just Enough Features 

When deciding on the feature set of an instrument that is 
supposed to be easy to use. the team faces a dilemma. Too 
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Oscilloscope/Logic Timing Analyzer Synergy 



Logic analyzers determine whether an input is high or low based on 
comparison of the mput voltage with a reference or threshold voltage. 
The logic analyzer looks at the input signals the same way digital hard- 
ware m the device under test does What is displayed on the logic ana- 
lyzer screen, in theory, is what the digrtal hardware in the system sees 
One technique that can be applied when using a logic analyzer is to take 
the same measurement several limes with slightly different threshold 
voltages and compare the timing diagrams captured by the logic analyzer. 
Different integrated circuits see the logic transitions (0 to 1 or 1 to 0) at 
different voltage levels For example, a TTL gate may switch anywhere 
between 0.8 volts and 20 volts By taking the measurement with slightly 
different logic analyzer thresholds, the user can often identify signals 
that are not being driven to the correct voltage level The user can also 
get a feel for what the system timing would be at different temperatures 
and across different production runs This gives some insight into timing 
variances that may cause problems down the road. 

Companion to an Oscilloscope 

Users often mistrust the logic analyzer because they feel it doesn't give 
the "real" picture of their waveforms In this case, the user is referring to 



the lack of analog content m the waveform displayed by the logic ana- 
lyzer On the other hand, the oscilloscope can show the waveform's 
analog content but the user can't trigger on tne event of interest Ideally, 
the solution is logic analyzer triggering with oscilloscope waveform 
viewing The HP 54620 logic timing analyzer is optimized to be a com- 
panion to the user's oscilloscope, not to replace it This analyzer has 
both a trigger input and a trigger output on the front panel The user 
can trigger it from an oscilloscope or use the HP 54620 as a trigger 
source to an analog or digital oscilloscope Fig 1 shows the HP 5462C 
triggering on a complex set of digital signals that combine to form a 
complex analog signal This is common in mixed-signal embedded 
systems that contain either a digital-to-analog converter or an analog- 
to-digital converter Often, the analog signal is difficult to trigger on 
with an oscilloscope This problem is solved by looking at the digital 
side of the signal and triggering on a specific pattern By using the two 
instruments together, the user can be more productive by getting a 
view of the situation that neither instrument alone can provide. 




Fig. I. The HP 54620 logic timing ana- 
lyzer can trigger on a complex set of 
digital signals and provide its trigger 
signal to an oscilloscope to display a 
resulting analog signal. 



many Features make the instrument hard to use because 
each additional feature adds more controls. ( )n the other 
hand, too few features dictate thai just when progress is 
being made the instrument's reduced capability becomes an 
obstacle. The optimum point strikes a balance between fea- 
tures and user interface complexity. The design goal in I he 
HP 54620 was "just enough features — an easy-to-use insl ru- 
men! with the capability I need to <' my job the majority of 
the lime." 

The best example of just enough features in the HP 54620 is 
in the area of triggering. The extent of trigger functionality 
in traditional logic analyzers is broad. Triggering capability 
ranges from the most basic edge on a single channel to the 
ability to have up lo ten sequence levels in a user-definable 
State machine, using pattern recognizers, range recognizers, 
timers, and event counters. Conversely, the most basic trig- 
gering available and sometimes the only triggering available 
on oscilloscopes is the ability to trigger on a rising or falling 
edge a' a given vollage on a specific channel. The IIP 54620 
needed in Hud a balance between a logic analyzer and an 
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oscilloscope since it is supposed to be an oscilloscope-like 
logic analyzer. The design team did several rounds of cus- 
tomer visits to understand the target customers, their needs, 
and I heir frustrations with traditional logic analyzers before 
finally arriving at the triggering feature set. 

The design team broke the triggering down Into three broad 
categories. First and simplest is edge triggering. This is the 
logic analyzer version of basic oscilloscope edge triggering. 
In this category the user selects a channel and an edge 
polarity. No voltage value is set because it is implied in the 
logic threshold. 

The second category' is simple pattern. Simple pattern trig- 
gering is denned as highs, lows, and don't cares specified 
across the input channels. The analyzer I riggers when the 
specified pal lent occurs. Optionally, this simple pattern can 
be combined with a rising or falling edge on a single channel. 
This optional edge gives the user the ability to qualify the 
specified input pattern. 

The HP 54620 use model called for users to use edge or sim- 
ple pattern triggering, or both, 80% of the time. Therefore, 
these trigger categories are accessible through buttons on 
the front panel. The user switches (rigger category- by simply 
pressing the front-panel key. 

The third category of triggering is advanced. This category' 
of triggering is in place for the 20% of the time when edge or 
simple pattern is not powerful enough. In just -enough- 
features thinking, the added capability of advanced trigger- 
ing ensures thai the instrument is not underpowered The 
advanced triggering feature set is best described by the 
overview screen shown in Fig. 4. Advanced triggering con- 
sists of two pattern and two edge resources that can be 
combined with the logical operators such as AND. OR. and 
THEN. In addition, pattern duration and edge occurrence trig- 
gers are available. While this advanced triggering capability 
is far below that provided by the most powerful logic analyz- 
ers, customer research showed that this feature sel would 
be acceptable in the majority of situations. 

Doing the Right Thing 

Customer research on waveform viewing tools indicates 
I hat these tools are used for bursts of activity separated by 
weeks or months of inactivity. This leads to customers re- 
learning the tools for each burst of activity. "Doing the right 
thing" is an attempt at reducing or eliminating I he relearn 
time. Research has shown that when an interface control 



docs what tht' user expects, it need not be explicitly remem- 
bered. Instead, users remember or attempt to remember 
controls that are exceptions to what they expect. Unfortu- 
nately, user interface exceptions are often forgotten and 
sometimes remembered incorrectly, resulting in frustrated 
users who can't gel the ins) rumen! lo do what they want. 
Doing the light thing means I he instrument responds as ex- 
pected, perhaps even delighting the user by doing exactly 
what the user had in mind in a given context. 

IIP 54020 customer research also determined that users 
need to be able to label the sixteen input channels so as lo 
heller understand the onscreen signals. Since the HI' 51020 
has no computer keyboard this presents a user interface 
problem. Our solution was to come up with a labelmaker. 
The labelmaker needed the ability to combine characters to 
form meaningful signal names. The labelmaker. shown in 
Fig. 5, provides control of character selection with a knob 
on the froiu panel. Softkeys are used to position the cursor 
and manipulate the label under Construction. 

Even with our best design effort, customers found inputting 
labels to be tedious. Several changes were then made to the 
labelmaker to make il do the right thing and delight the custo- 
mer. The first change was the inclusion of a label dictionary. 
This nonvolatile label list of 75 entries is remembered when 
the instrument is turned off and is seeded by the factory 
with a number of standard signal names. Research indicated 
that many customers were using the same label names 
based on which microprocessor or microcontroller they 
were using. The label dictionary then becomes customized 
to the user's labels over time. As the user constructs a new- 
label, it is saved in the label dictionary and the least recently- 
used label is remov ed. Using the label dictionary dramatically 
reduces the number of labels that need to he constructed 
using the labelmaker, making I he product much easier to 
use. This shortens the setup time of the instrument and 
leads to more understanding of the signals on the screen. 

The second change in the labelmaker has been identified as 
a "delighter" by customers. In defining labels for input sig- 
nals, the user is often looking at a bus such as an address or 
data bus. A typical naming convention for buses is ADDRO, 
ADDR1. A0DR2. and so on. The prefix can change but the num- 
ber at the end defines the specific bus bit. In the labelmaker 
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the user assigns the constructed label to the selected chan- 
nel by pressing the Assign Label softkey. In the normal case. 
Assign Label simply does the assignment to the channel, not 
changing the edit field or the selected channel. In the case of 
a bus assignment, thai is. when the new label ends in a num- 
ber, the Assign Label softkey has a different behavior. It urn 
only does the label assignment, but also increments the 
selected channel and increments the label in the edit field to 
the next logical bus bit. For example, if I have just construc- 
ted a label named DATAO the Assign Label softkey will assign 
that label to channel 0. Since this is the define bus case (the 
label ends in a number), the new label in the edit label field 
will be incremented to DATAI and the selected channel will 
increment to channel !. Thus, to define a bus, the user need 
only press the Assign Label softkey rej>eatedly to label the 



entire bus. In addition, the label dictionary' is smart: only the 
first bus label is saved in the dictionary so that the entire 
dictionary' is not filled with bus label entries. 

Conclusion 

The HP 54630 shows how contributions can be made in user 
interfaces and optimized feature sets while leveraging exist- 
ing Hewlett-Packard technology. Customer research and 
usability testing are key ingredients this process. 
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High-Sample-Rate Multiprocessor- 
Based Oscilloscopes 



The HP 5461 5B and 5461 6B oscilloscopes blend proprietary high-speed 
sampling technology with the power of digital signal processing and a 
proven user interface to deliver usable advanced characterization 
capability. 

by R. Scott Brunton 



The design of I he HP 546 15B and 546 1613 oscilloscopes 
(Fig. 1) focused on adding higher sample rate and extended 
memory depth to the attributes of the HP 54600 product 
family. ' Increasing the sample rate to 1 GSa/s and 2 GSa/s, 
respectively, broadens the confidence that narrow signal 
transients will he acquired and, combined with very respon- 
sive front-panel controls, presents a visually dense image of 
the acquired waveform. To provide reliable acquisitions over 
even the slowest time base settings, special hardware can 
be engaged to detect and capture peaks as narrow as 1 ns 
in width. 

Successful development depended on retaining the person- 
ality of the HP 54600 family by projecting "the feel of analog 
and the power of digital" along the many dimensions of the 
customer use model. 
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Fig. 1. The HP 54*1 W,H •>sr-illosf<ipc (top) and the HF 5461513 
oscilloscope (bottom) offer higher sample rates and extended 
memory depth. 



Product Description 

The HP 54515E3/10B represent variants on the same soft ware 
platform. Pursuing this commonality in the underlying soft- 
ware allows efficient delivery of the features and strengths of 
previous HP 54600 products. To ensure the analog feel of the 
user interface, fast display update and crisp display quality 
were cornerstone metrics. Bezel-mounted softkeys facilitate 
access to advanced control features, and front-panel controls 
are derived based on the function and operation of classical 
analog oscilloscopes. Each of the two independent channels 
delivers a system analog bandwidth of 500 MHz. 

Behind the CRT display system and familiar front-panel 
design is a microprocessor-based acquisition plat form that 
centralizes the overall scheduling of acquisitions and subse- 
quent data filtering and abstraction. Instrument control is 
through user front-panel actions or remote control via the 
HP-IB (IEEE 488, IEC 625) or RS-232. Printers are supported 
to provide hard-copy imaging of acquisitions and setups. 

Optional expansion modules deliver seamless access to 
additional postprocessing capabilities and full-featured I/O 
connectivity. Standard 1-ns digital peak detection, advanced 
measurement functions, and the ability to view events 
occurring before a trigger permit accurate and detailed 
characterization of target system designs. 

HP 54615B/16B Architecture 

Extending the architecture developed for earlier product 
family members, 1 these new products leverage proprietary 
technology blocks to deliver added performance. The im- 
proved architecture is represented in Fig. 2. Waveform up- 
date rate and instrument performance are increased through 
the delegation of primitive tasks to slave processing ele- 
ments. Three processors are used. Each has specific tasks 
and responsibilities that result in a parallel processing boost 
in responsiveness and a reduction in the "blind" time between 
acquisitions. 

Host Processor 

The MC68000 host microprocessor is responsible for the 
general control and operation of the product. It is the sole 
provider of service to the front panel, which is of high priority 
because instrument responsiveness is directly associated 
with this processor's performance. 
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Fig. 2. Architectural block diagram of the HP 54615B and 54G10B 
oscilloscopes. 

In addition to this task, the host maintains communication 
with and dispatches tasks to the other two processing 
blocks, where automatic measurements are performed on 
acquired data and interaction with the optional modules 
takes place. 

During an acquisition, the host processor also tracks the 
current run state of the instrument and coordinates the sub- 
sequent software processing. 

Video Display Processor 

Once data has been acquired and placed into shared memory 
for display, the video display processor postprocesses the 
data record. Data is translated front a time-correlated, 
sequential organization to one that is appropriate for the 
display controller. In addition, vector processing on raw 
data can be performed. This dedicated display processing 
resource results in a net reduction in the caplure-to-display 
time. 

Digital Signal Processor 

The T1320C50 digital signal processor (DSP) acts like an 
acquisition subsystem controller and data processor. It is a 
slave to the host MC68000 from which it receives instrument 
stale informal ion. The DSP manages the time base and per- 
forms sample rate generation. Once a valid trigger is recog- 
nized and data is acquired and unloaded from the dedicated 
front-end acquisition system, the DSP performs time correla- 
tion in preparation for postprocessing. 

Working at cycle limes far shorter than that available on the 
host MC68000, the DSP. depending on the acquisition mode, 
perforins record averaging and primary vector compression. 
This processed dala record is transferred into a shared 



memory segment and control is passed back to the host pro- 
cessor. Of significance is the fact that the video display pro- 
cessor accesses this same shared memory segment to trans- 
fer the newly acquired record into video RAM without any 
support from the host. Thus, acquisition-to-display time is 
markedly superior to methods that require host involvement. 

Completion of the acquisition cycle is communicated to the 
host and. if necessary, the trigger and front-end hardware 
are reprogrammed and armed for the next acquisition. 

Acquisition Front End 

The HP 54615B/16B gain many of their capabilities by lever- 
aging hardware technology from other members of the HP 
54600 product family. The combined analog attenuation and 
digital sampling system delivers a signal-input rise time of 
700 ps and a sampling rate of 1 or 2 GSa/s. This produces an 
effective repetitive bandwidth of 500 MHz on both products 
and a single-shot bandwidth of 250 MHz on the HP 54615B 
and 500 MHz on the HP 54616B. Through the use of inte- 
grated hardware. 1-ns peak detection is achieved over all 
time base ranges. Following the front-end hardware is a 
proprietary memory system that is configurer! to provide 
5000-point acquisition rec ords for each channel. 

These features permit a high sustained sample rate over a 
wide number of time base ranges. Engaging the hardware 
peak detection capability is an effective defense against the 
possibility of missing fast transient input signals. 

Other Capabilities 

An integrated internal calibration port enhances the ease of 
use and reduces the cost of ownership of these products. 
The CRT. the front-panel keyboard, the internal system 
memory, and other hardware subsystems are easily tested 
without the need for additional test equipment. Both vertical 
and horizontal representations of the acquired data are 
calibrated for variations in hardware components and the 
I hernial environment. 

Additional measurements, enhanced functional capability, 
mask testing. FFTs, additional nonvolatile memory for setup 
and I race saving, and expanded I/O connectivity can be pro- 
vided through the use of attachable option modules. 

Software Block Diagram and Theory of Operation 

The software environment can be though! of as being 
assembled from a number of islands (i/conlnil. With the 
host MC68000 acting as a central dispatcher, tasks are deliv- 
ered to any of several subcontrollers. Each subcontroller 
retains independent slate information regarding the underly- 
ing hardware, In the IIP 54615B/10B products, there are 
three such Islands of control. 

The first is the MC68000 itself. In addition to running the 
operating system, the host processor is responsible for dis- 
patching messages related to main and delayed time base 
sweeps for performing data abstractions on acquired data. 
In particular, automated measurement calculations on 
acquired data, mask testing. FFT computation, and function 
evaluations are performed by the host. 
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Wlifii tin- stair of the host instrument dictates that messages 
be dispatched to neighboring islands, two methods are em- 
ployed. The first is a time-sliced access method in which 
access to a particular island is guaranteed during a given por- 
tion of lite overall instrument cycle time. During this time, 
information ran be exchanged and island stale variables 
affected. An example of this type of dispatch is the video 
display processor. When vectors are enabled or disabled, the 
video display processor is informed of this change of state, 
hi turn, the video display processor effects the necessary 
changes in the hardware under its control in an autonomous 
manner. 

The second method of message dispatching is based upon a 
traditional message/interrupt scheme. The host places a 
message in shared memory, which by protocol is guaranteed 
not io be corrupted by other islands, and then semis a hard- 
ware interrupt to the neighboring island controller. Detection 
of the message is asynchronous; the host will not initiate 
subsequent message/interrupt cycles until a handshake from 
the first is recognized. The DSP is an island that operates on 
this premise (Fig. 3). For example, when the 1-ns peak 
detection capability of the instrument is enabled, several 
message/interrupt packets are built to make the necessary 
changes to the DSP's internal state variables and tables. 
Acquisitions are then restarted in the new mode. 

Because the host island is running a multiprocess operating 
system, it can be thought of as several smaller island pro- 
cesses. In addition to the one responsible for communicating 
with the DSP Island, others manage the front-panel keyboard 
;uid remote control interfaces. For example, when user 
interacton is detected by the host, reading of the keyboard 
state is performed by means of memory mapped hardware 
located within the address space of the host system RAM. 



Each island has specific responsibility for its respective 
underlying hardware subsystem. Software driver/hardware 
pairs exist within each island and are unique. For example, 
the DSP island contains a specific driver to control and 
operate the integrated time base generator hardware. 

Code Reuse 

An important part of the effectiveness of this development 
was the the application of software code reuse. An expanded 
and enhanced printer library was exploited without change 
to accelerate the dev elopment. Taking advantage of soft- 
ware that is ripe for reuse has improved the reliability of this 
family of instrument products. Consistent with the goal of 
providing familiar functions throughout the product family, 
this effort continues. 

Summary 

Through the application of task-specific hardware and soft- 
ware subsystems, the analog feel of the user interface has 
been retained while extending the envelope of performance. 
The notion of islands of control with autonomous operation 
effectively permits improvements in responsiveness, while 
an emphasis on software code reuse improves maintainability 
and the leveraging of existing technology. 
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A Dielectric Spectrometer for Liquid 
Using the Electromagnetic Induction 
Method 

Key parameters of colloids are often directly related to or can be derived 
from permittivity or conductivity. Dielectric dispersion analysis (dielectric 
spectroscopy) yields insights into colloidal properties. A dielectric meter 
using a new sensing technique has been developed. 

by Hideki VVakamatsu 



Dielectric spectroscopy is useful for Hip characterization of 
colloidal dispersions. In the past, dielectric spectroscopy 
has been attempted using parallel literal electrodes to mea- 
sure the permittivity and conductivity of colloids. However, 
it is difficult in practice to make these measurements in con- 
ductive solutions because large measurement errors can be 
caused by electrode polarization, which is a kind of contact 

impedance between the electrode and the solution, Electrode 

polarization behaves like a capacitance and masks the true 
properties of the solution at low frequencies. 

The HP K:i(|!5()A colloid dielectric probe was developed for 

colloidal liquid evaluation, its electromagnetic induction 

technique eliminates the electrode polarization effect. The 
probe operates from 75 kHz to :«) MHz with the HP -12S5A 
precision U'R meter and IIP VEE software on an IIPYcctra 
or compatible computer ( Fig. I ), Tin- IIP VEE environment 1 
provides easy operation, display, and analysis of the measure- 
ment data. 



Background of Dielectric Spectroscopy 

Colloids are dispersion systems composed of a dispersed 
substance in particulate form in a continuous-phase disper- 
sion medium. There are many types of colloid; some familiar 
examples are listed in Table I. 

Since there are interfaces between the dispersed substance 
and the surrounding dispersion medium m a colloidal dis- 
persion, there can be appended (nonintrinsic) dielectric 
relaxations — typically the permittivity decreases and the con- 
ductivity increases w ith increasing frequency — as a result of 
interfacial polarization caused by charge buildup on the 
boundaries between the different materials. The analysis of 
these dielectric relaxations based on an appropriate theory 
of interfacial polarization provides Valuable information on 
the structural and electrical properties of colloidal particles.-' 
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Colloid 

Shaving Foajn 

Fog, Cloud 
Milk. Mayonnaise 
Butter, Margarine 
Charcoal 
Blood 

Black Ink 



Table I 
Familiar Colloids 

Dispersion 
Medium 

Soapy Water 
(liquid) 
Air (gas) 
Water ( liquid ) 
Fat, Oil (liquid I 
Carbon (solid) 
Serum (liquid ) 

Water (liquid) 



Dispersed Substance 

Air, Propane (gas) 

Water (liquid) 
Fat, Oil (liquii!) 
Water (liquid) 
Air (gas) 

Erythrocyte (micro- 
capsule) 
Carbon Black 
(solid) 



The frequency characteristics of the permittivity and con- 
ductivity of colloidal solutions are especially informative. 
Fig. 2 shows some examples. A practical means of measuring 
these characteristics — that is, practical dielectric spectros- 
copy — would be a significant contribution to the study of 
the stabilization of dispersion systems and product quality 
control. 



Dielectric Measurement of Colloidal Solutions 

Traditionally, permittivity is measured with parallel metal 
electrodes. This technique can be used to measure noncon- 
ducting (nonionic) solutions such as oils or alcohols. How- 
ever, in the case of salty or ionic solutions that have high 
conductivity, this method suffers from large measurement 
errors. Fig. :ia shows the mechanism responsible for the 
errors. Electrode polarization results from the electrical 
double layer bet ween the metal electrode surface and the 
ionic solution. This becomes a serious problem at low fre- 
quencies because electrode polarization is a capacitance, 
SO the contact impedance is large at low frequencies. If the 
total impedance consisting of the solution impedance and 
the electrode polarization impedance is counted as the solu- 
tion impedance, a large error is incurred at low frequencies 
(Fig. 3b). In other words, the increase of the contact imped- 
ance at low frequencies masks the true properties of the 
solution. 

The second reason why il is difficult to make permittivity 
measurements is as follows. Often in aqueous colloid spec- 
troscopy, the permittivity is a minor part of the admittance 
compared to the conductivity. This means that il lakes a 
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highly precise measurement to extract the small capacitive 
permittivity component from the almost entirely conductive 
admittance. For example, in 0.1% aqueous NaCl (t> = 80, 
k = 0.2 S/m), the magnitude of the susceptance (the permit- 
tivity part) is only about 1/500 that of the conductance (the 
conductivity part.) at 100 kHz. If 1% accuracy of the permit- 
livity analysis is required, the measurement system must 
measure the argumenl of the complex admittance accurately 
within 20 microradians. 

Nonelectrode Electromagnetic Induction Method 

If the electrode polarization could be removed, I he true im- 
pedance of the solution could be measured. To eliminate the 
electrode polarization, a measurement technique without 
electrodes has been implemented. In ibis method, a closed- 
circuii current is made to flow in the solution by electromag- 
netic induction. This current is then measured, also by elec- 
I romagnet ic induction. 

Fig. -I illustrates the principle of this measurement tech- 
nique. When a pair of toroidal coils are immersed in the 
solution, the solution completes the circuit between the 
coils. When the primary coil is excited from the ac signal 
source, a magnetic field in the primary coil induces an elcc- 
Iric field around the primary and secondary coils, and the 
electric field causes a closed-circuit current in the solution. 
This closed circuit current induces a magnetic flux in the 
secondary c oil and the magnetic flux induces an electro- 
motive force in the secondary coil. If the current flowing in 
the secondary coil is measured, the current in the solution 
can be determined. Since the magnitudes of the currents in 
the solution and the secondary coil are proportional to the 
closed-circuit solution admittance, the conductivity or permit 
livity of the solution can be calculated from the measured 
secondary current and primary voltage. 



The induction method avoids the electrode polarization 
effect and has been used in some conductivity meters suited 
for liigli-conducthiry solutions. However, while these meters 
can measure conductivity, they cannot measure permittivity 
because of the difficulty of measuring the small capacitive 
part of the admittance, as discussed in the last section. The 
capacitive part (i.e., peniiittivity) of the admittance is much 
smaller than the conductive part for most aqueous solutions 
When a detection device such as shown in Fig. 4 is used to 
measure a capacitance, measurement errors are produced 
by undesired coupling between the primary and the second- 
ary. This coupling includes leakage magnetic flux coupling, 
stray capacitance coupling, and acoustic coupling caused by 
magnetic distortion of the magnetic cores. All of these are 
severe problems even if the amount of coupling is small 
because the secondary current corresponding to the permit- 
tivity is extremely small, even for liquid that lias a relatively 
large permittivity, such as water. This current is often less 
than one millionth of the primary current at low frequencies 
(e.g., < 100 kHz). 

We have developed a new probe and system to solve the 
problems discussed above. As mentioned earlier, the probe 
operates from 75 kHz to 30 MHz with the HP 4285A precision 
LCR meter and HP VEE software on an HP Vectra or com- 
patible computer. The HP 4285A applies IV to the primary 
coil, achieves accuracy on the order of 50 microradians or 
200 picoamperes in the secondary coil current measurement, 
and then outputs the vector ratio of the current to the pri- 
mary voltage, which is measured remotely (Fig. 5). The per- 
mit livity and conductivity, which are functions of the vector 
ratio, are calculated from the measured vector ratio by the 
software in the computer. 

Probe Architecture 

The induction method completely eliminates electrode polar- 
ization However, some improvements to reduce the unde- 
sired couplings were necessary for practical use. 

The first prototype of the probe had the structure shown in 
Fig. (i. The interior of this probe consists of two toroidal 
coils and electrostatic shields, which are c overed with 
epoxy resins. This prototype yielded knowledge in three key 
areas. 
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First, although extremely delicate, permittivity measure- 
ment is possible. In spite of the tight shield between the pri- 
mary and secondary coils, the residual admittance from the 
undesired couplings can be larger than the effective admit- 
tance from the permittivity of water (Fig. 7). However, this 
residual admittance can be canceled out with the assistance 
of software corrections. 

Second, an asymmetrical electric field produces an error if 
there is a ground loop. When measuring solutions on a beaker 
scale, where the solution is nol grounded, there Is no problem 
using the structure shown in Fig. 6. However, if the solution 
container is on a tank scale and the solution is nearly 
grounded, a new problem arises. The asymmetrical field 
such as the one shown in Fig. G produces unbalanced volt- 
ages on the symmetrical shield structure. This common-mode 




Solution 



Fig. 6. Structure of the firs! prototype probe. 





Fig. 5. Simplified schematic 
diagram of the probe anil LCK 
meter. 

voltage produces a ground-loop current, which is measured 
like an offset capacitance (permittivity). This offset phenome- 
non is very critical because it is unstable and often larger 
than the permittivity of water. To eliminate this offset effect, 
the probe structure must generate a balanced field. 

Third, acoustic (microphonic) coupling also produces 
severe problems. The shape of a ferrite core that has a large 
permeability is geometrically transformed by a magnetic 
field. Vibration of the magnetized core produces an electro- 
motive force in the coil wound on the core. When the pri- 
mary coil is excited from the ac signal source, the primary 
core vibrates. The secondary core picks up this mechanical 
vibration and produces an electromotive force in the second- 
ary coil if the secondary core is magnetized. This mechanism 
spoils the isolation between the primary and the secondary. 
Since the toroidal cores have many resonant modes of 
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Fig. 7. Effect of roiipling on secondary coil current in water. 
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Fig. 8. Final probe structure. 

mechanical vibration between 100 kHz and 1 MHz, they pro- 
duce sharp peaks of residual admittance at specific frequen- 
cies (Fig. 7). This residual admittance cannol be canceled 
out by corrections because it is unstable: its magnitude and 
frequency depend on the temperature, the magnetization 
state, and other factors. To prevent (his coupling, a suitable 
buffer material is required between the primary and 
sec ondary cores. 

On the basis of the above knowledge, a new probe structure 
was developed. Fig. 8 shows the final structure. 

Accuracy Enhancement: Calibration and 
Compensation 

Permittivity measurement is extremely delicate. Corrections 
are very important to extract the small capacitive component 
from the almost entirely conductive admittance and to cancel 
out the residual admittance of the probe. 

Open/Short/Load (OSL) Calibration. The equivalent circuit of 
the probe with the LCR meter is shown in Fig. 9. The solu- 
tion's loop admittance Y x and the LCR meter's measurement 
value Y m are related by the following equation based on gen- 
eral impedance measurement theory: 

v „ K 2 + Y ». 

Yx = Ki rTK^i' (l) 




Fig. 9. Calibration model for probe with LOR meter. 

where Kj. K^, and lyj are frequency dependent complex con- 
stants. The Ki, K2, and Kg are parameters related to the cir- 
cuit of the measurement system and can be derived from the 
circuit constants. However, it is not necessary to analyze the 
details of the circuit, The best way to determine Kb K>. and 
K3 is known as open/short/load (OSL) calibration. The three 
unknown parameters can be calculated from the relationship 
between the values (Y x ) of three known admittances and 
their measured values (Y m ). 

The three known admittances can be prepared solutions, 
but a simpler, more accurate, and more stable calibration 
method is used in our system. The calibration kit shown in 
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Fig. i L Erroi model Involving straj capacitance < 

Fig. 10 was developed for the OSL calibration of the probe. 
The kil consists of 50-olwi arid ()-ohm one-porl standards 
and an adapter thai links the standards with the probe ring. 
For the load stale the probe is placed in the adapter con- 
nected to the fill-ohm standard, for the short slate the probe 
is placed in the adapter connected to the 0-olun Standard, 
and for the open state the probe is placed in air. 

When measuring a solution under lest the following relation- 
ships exist between the admittance V\ and the conductivity 
and permittivity. 

Conductivity: k = K ( .,.||Re( Y v ) (S/nt) (2) 

Permillivity: e = K,.,.||lm(Y x )/ei |F/inJ i-i) 

Relative permittivity: E r = e/e„, e„ = S.85 x lfr 12 FViu. 

K,.,.]i [nr'| is a frequency independent, real-number conslant 
related to the probe dimensions. Re(\\) and Im(Y x ) are the 
real and imaginary parts of Y v and (0 is angular frequency. 
Measurement of a known solution (e.g.. KG in water) can 
determine the K,.,.n value, which needs to be obtained only 
once in the developmental stage of the probe. This is because 
the probe is molded, and the probe-to-probe variations in its 
dimensions are insignificant. Differences in the dimensions 
of the probe do produce errors in the amplitudes of the mea- 
sured values, but the value of primary importance for mea- 
suring permittivity in conductive solutions is the argument 
of the complex admittance rather than the amplitude. 

Compensation. In theory the ( iSL calibration should prov ide 
all the correction that is necessary. However, a problem was 
discovered when the conductivity and permittivity of a stan- 
dard KC'l solution were measured after calibration. The K, ,.n 
derived front the real pail (conductivity pari ) of Y x and 
another K, ,,|| derived from the imaginary part (permittivity 
part ) of Y x did not coincide. In other words, while the K,- ( .n 
value should be determined uniquely by the probe shape. 



in practice a K,-,,n value determined this way will give the 
wrong answer for either conductivity or permittivity. For 
example, when using the K,,.n value determined for conduc- 
tivity, the calculated permittivity of conductive water was 
usually 30% lo 50% smaller than its true value. Furthermore, 
this error varies depending on the conductivity, permittivity, 
and measurement frequency. To solve this problem, the 
error model shown in Fig. 1 1 w as developed, 

The compensation equation derived from the model in Fig. 1 1 

is: 



Yx-irm* — 



1 + + V„/Y s 



Yx. 



(4) 



where Y x ., n „, is the compensated Y x , Y„ = joiC„, and ('„ is a 
characteristic stray capacitance between the copper shield 
in the probe mold and the surrounding solution. By using 
Y x .| nll . instead of Y x in equation 2 or 3, Ihe inconsistency 
between Ihe K, ,.]] values for conductivity and permittivity 
can be improved from 50% to less than 2%. Fig. 12 illustrates 
the efficacy of Ihe compensation in permittivity. 

The value oft',, is determined easily after determining Ihe 
K,,.|| for conductivity. A solution with a known permittivity 
(e.g.. water E r = 80) is measured, and a (',, value is found 
thai gives Ihe correct permittivity when used in equations :i 
and 1. 

This method would not work if the conductivity and permit- 
tivity were lo interact, since K,,,n and (',, would then be diffi- 
cult to determine independently. However. Ihe effect of C„ 
on Ihe measured conductivity value is insignificant. 
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Fig. 13. Dielectric relaxation of six liquid car waxes. 



conductivities and low permittivities, but the induction 
method should be used for high conductivities. 

The second demonstration is monitoring yeast growth in 
fermentation. Dielectric spectroscopy is a noninvasive tech- 
nique suitable for characterizing living biological cells. Bio- 
logical t ells are enveloped in a plasma membrane that is 5 
to 10 nm thick. The conductivity of the plasma membrane 
of a viable cell is negligibly small compared with that of the 
external medium and the cytoplasm. Since the medium- 
membrane-cytoplasm structure has the structure of a dielec- 
tric material sandwiched between two electrodes, it behaves 
like a capacitor, and the permittivity of a medium containing 
cells is larger than that of an empty medium. The increase in 
the permittivity is proportional to the cell concentration 
(Fig. 14). Thus, the cell concentration in the cell culture can 
be monitored by the dielectric method. 

Advantages of the HP K5050A system for this application are 
as follows. 

The technique is nondestructive and requires no sample 
preparation before analysis. The measurement is quick and 
simple and online monitoring can be easily realized with 
computer control. 

Fermentations use ionic culture media, that is, high-conduc- 
tivity aqueous solutions. The induction probe can eliminate 
the electrode polarization problem and can measure the 
permittivity with Ihe required accuracy. 
The dielectric method counts only viabie cells. Dead cells 
with plasma membranes that are leaky to ions are not polar- 
ized and are not counted. The method is insensitive to turbid 
and colored culture media and carbon dioxide bubbles. 

Fig. 15 shows the relative permittivity change in beer fer- 
mentation at 12 G C. The dielectric online monitoring of beer 
fermentation has succeeded only when the elect rodeless 
method was used. The measurements were made at frequen- 
cies between 0.1 MHz and 3.1 MHz. The increase in permit- 
tivity up to 80 hours is because of the increase in cell con- 
centration and the increase in cell volume. The undulating 
curves at low frequencies found at the early stage of the 



Applications 

There are many researchers using dielectric spectroscopy to 
evaluate collokis. Examples include water-in-oil and oil-in- 
water emulsions, microcapsule suspensions, biomass moni- 
toring in fermentalion. and liposomes. The relations between 
colloidal structures and dielectric relaxations can be ex- 
pressed as equations. By measuring the dielectric properties, 
the researchers hope to measure the structure, geometry, and 
dimensions of dispersed substances in detail and quantita- 
tively. We expect that the HP E5050 probe will help make 
such research practical and efficient by eliminating elec- 
trode polarization. We will demonstrate two examples of its 

application. 

The fust example is liquid car wax. Wax is a water-in-oil 
colloid that has dielectric relaxation characteristics. Fig. 13 
shows the relative permit! ivily and conductivity for six sam- 
ples of wax. The samples can be classified into three types 
of colloidal structures based on their relaxation curves. 
Noisy data Was Obtained for low conductivities. It mighl be 
more suitable lo use ihe metal electrode method for low 
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Fig. 14. Relative permittivity (at 3(10 kHz) as n function uf yeast 
concentration in fermenting beer. 
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fermentation arc caused by highly synchronized yeast 
growth.' 1 Tlic data can reveal not only the cell concentration 
litil also information about cell shape, the plasma, and the 
membrane that is useful to chemists and biologists. 

Summary 

Quantitative Information about the structure of colloidal 
dispersions can be obtained from the frequency characteris- 
tics of their dielectric properties. The practical realization of 
dielectric spectroscopy represents a major contribution to 
the study of the stabilization of dispersion systems and 
product quality control. Because it is nondestructive, dielec- 
tric spectroscopy is also suitable for characterizing living 
biological cells. 

Traditionally, permittivity or conductivity measurements 
have been made with parallel metal electrodes. However, 
for conductive colloids, it is difficult to get enough measure- 
ment accuracy in practice because this method has large 
measurement errors caused by electrode polarization. 



Electrode polarization can be eliminated by using an induc- 
tion method, hut permittivity measurement is still extremely 
delicate. In many situations of aqueous colloid spectroscopy, 
the permittivity is a minor part of the admittance in compari- 
son with the conductivity, so precise measurements are 
required. We have developed a suitable probe structure and 
some correction methods for aqueous colloids. The struc- 
ture realizes the low residual admittance required for accu- 
rate permittivity measurements and also generates the bal- 
anced field that is necessary to solve ground-loop problems. 
The corrections make it possible to extract the small capaci- 
tive component accurately from an almost entirely conduc- 
tive admittance. 
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Emulating ATM Network Impairments 
in the Laboratory 



This article discusses a new product for the HP Broadband Series Test 
System. The HP E4219 ATM network impairment emulator allows 
telecommunication network and equipment manufacturers to emulate 
an Asynchronous Transfer Mode network in the laboratory. 

by Robert W. Dmitroca, Susan G. Gibson, Trevor R. Hill. Luisa Mola Morales, 
and Chong Tean Ong 



ATM (Asynchronous Transfer Mode) is a new networking 
technology thai is currently being deployed for tinie-sensitive 
traffic in high-speed local and wide area networks. Informa- 
tion is encoded into short, fixed-length cells of 53 bytes, 
consisting of 48 bytes of payload and five bytes of header, 
as illustrated in Fig. 1. 

An ATM connection can have multiple virtual connections 
because of a two-part address field in the cell's header — the 
YPI (virtual path indicator) and the VCI (virtual channel 
indicator). These fields identify the channel to which the 
cell belongs and consequently the qualify of service it will 
receive. 

ATM has many advantages over other existing networking 
technologies, such as Frame Relay and SMDS. Its short, 
fixed-length cell allows switches and multiplexers to route 
data directly in hardware, as opposed to the slower, software- 
driven approach requited with variable-length packets. ATM's 
bandwidth-on-demand feature means that a single network 
can carry all types of traffic — voice, video, image, and data. 
In summary, ATM is an efficient, flexible technology with the 
ability to support multimedia traffic at extremely high rales. 

Since ATM network implementation is still in its preliminary 
stages, equipment manufacturers and service providers have 
a tremendous need for test solutions — in particular, a way to 
emulate ATM networks in die laboratory so that services 
and equipment can be tested before deployment. The IIP 
Communications Measurements Division (CMD) has identi- 
fied three major benefits that ATM network emulation can 
provide: risk reduction, design verification, and robustness. 
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Risk Reduction. Full deployment of any network is a very 
expensive proposition, especially since some planned net- 
works are national in size. Even if a network is available to 
test, creating maximum delays or errors is difficult since 
these impairments occur only under congested or abnormal 
conditions. Network providers can use emulation to test 
their networks in a well-controlled environment. 

Design Verification. Rimnmg time-sensitive voice and video 
data over a packet-switching network such as ATM is an 
active area of research. Because ATM networks have in- 
herent delays, encoders must be able to handle both vari- 
able and fixed delays gracefully. Network emulation allows 
equipment vendors such as encoder manufacturers to verify 
their designs in the laboratory in a relatively inexpensive way. 

Robustness. Network equipment manufacturers also need to 
confirm that their algorithms and designs work under stress. 
For example, some early ATM switches had buffers smaller 
than a single TCP/IP datagram. When they used AAL-5 (the 
ATM adaptation layer protocol selected to carry TCP/IP data- 
grams), t he buffers overflowed and dropped cells. Because 
AAL-5 employs a high-layer, end-to-end error correction 
scheme, the AAL-5 packets could not be reconstructed and 
consequently the TCP/IP datagrams were also lost. Almost 
no dat a throughput occurred even when error rates were 
low. Network emulation detects these kinds of unexpected 
behaviors before they become costly mistakes. 

Development Partner 

Because CMD develops state-of-the-art test equipment for 
emerging telecommunication technologies, short develop- 
ment times are crucial. When CMD management made the 
decision to develop a network emulation module, it 
became clear that we would need a development partner 
with more experience in this area if we were going to ship 
on time. 

We were very fortunate to find that Telefonica Investigation y 
Desarrollo (TI+D) in Madrid, Spain had previously conducted 
research in network emulation and was willing to cooperate 
with us to turn their research into a rnanufaclurable product 
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TI+D is the research ami development arm of Spain's national 
telephone system. 

TI+D had developed a network emulator that showed some 
promise as a commercial product. However, the module's 
impairment specifications were not quite right for two rea- 
sons: the module was designed to interface to a proprietary 
TI+D line interface, and it was not designed to be manufac- 
tured in volume. However, we were confident that we could 
use the basic technology of this hoard and redesign it to work 
in the IIP Broadband Series Test System. Development of 
the IIP E4219A network impairment emulator began in the 
fall or 1994. 

Network Impairment Emulator 

The IIP Broadband Series Test System (BSTS) is a compre- 
hensive platform for testing high-speed local and wide area 
networks. Designed primarily for laboratory research and 
early network deployment, the BSTS consists of a VXIbus 
mainframe, an HP-UX* controller, between one and 12 spe- 
cialized VXIbus modules for different physical interfaces and 
convergence layers, and a variety of software applications 
for testing ATM, SMDS. Frame Relay, and other protocols. 

Fig. 2 is a block diagram of the BSTS with three VXIbus 
modules — the minimum required when testing with the net- 
work impairment emulator. The modules in Fig. 2 are as 
follows: 

• The HP-L'X controller, a UNIX* -operat ing-system-based 
X Windows controller, provides the instrument interface 
(GUI and keyboard) and controls all other modules via the 
VXIbus. 

• The network impairment emulator inserts user-specified 
impairments on selected ATM traffic. The network impair- 
ment emulator is a unidirectional product. Two network 
impairment emulators and two line interfaces are required 
to insert impairments in both directions. 



BSTS 




Fig. 2. Diagram of the HP 750(11) Broadband Series Test System 
with network Impairment emulator. 



• The line interface provides the means by which the BSTS 
connects to a line. In normal operation, the line interface is 
placed in the middle of a link. ATM traffic is received by the 
line interface, passed through the network Impairment emu- 
lator, then placed back on the line by the line interface. A 
variety of line interfaces work with the network Impairment 
emulator, ranging from Tl to SONET OC-3. 

The network impairment emulator connects to a line inter- 
face through an IIP proprietary cell bus implemented with 
VXIbus LBUS connections, Using the HP-L'X controller, a 
user can insert into an ATM line a series of impairments that 
fall into two basic categories: delays and errors. 

Delays. Inherent delays are a normal part of the operation of 
any network. ATM networks being no exception. Although 
delay is not especially critical for data communications 
applicat ions, video and voice are highly sensitive to jitter, or 
variable delay. Since one of the most important commercial 
uses of ATM technology will be the transport of digital 
MPEG-2 encoded video, the network impairment emulator 
enables equipment manufacturers such as encoder and set- 
top box designers to verify their designs before they are 
deployed. 

Errors. All network transmissions are subject to errors caused 
by thermal noise, switching equipment, lightning strikes, 
magnetic fields, interference by other channels, and many 
other phenomena. The network impairment emulator can 
ii\ject errors into a line to simulate these kinds of network 
malfunctions. This enables service providers to ensure that 
higher-layer transport protocols can detect errors and restart 
transmissions, to verify that acceptable throughput can be 
achieved under worst-case error levels, and to determine the 
worst-case error rate before video and voice degradation 
become an issue. 

Emulator Design 

The network impairment emulator offers five impairments: 
cell delay variation (CDV), constant cell delay, cell error, cell 
loss, and cell nusinsertion. 

As illustrated in the emulator block diagram. Fig. 3, cells 
flow from the cell bus, through the impairments, and back to 
the cell bus. Impairment modules can impair all or a portion 
of the ATM cell stream. 

The cell bus interface takes cells from the "drop" side of the 
four-bit EC'L cell bus and converts them into eight-bit TTL 
signals. It also receives cells from the last Impairment in the 
chain and places them back onto the "add" side of the cell 
bus. DT[7..0] are the eight signal lines, CS is a cell start indica- 
tor, which goes active at the start of a cell, and EC is the 
empty cell indicator, which indicates whether the cell con- 
tains valid data. 

The VXIbus is used to control the logic of each impairment 
module. The VXIbus controller sets up impairments as re- 
quested by the user. 

The network impairment emulator mimics various network 
conditions by inserting impairments onto a line based on a 
user-defined statistical distribution. The network impairment 
emulator's statistical information is contained within the 
distribution tables, which hold either normal, binomial, 
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Fig. 3. Block diagram of the HP 
E4219A network impairment 
emulator. 



exponential, geometric, deterministic, or user-defined dis- 
tributions. All distributions are user-selectable. Inside the 
statistical distribution generator block are pseudorandom 
number generators that determine the time intervals 
between impairment events. 

Constant Cell Delay Module 

The constant cell delay module can delay all or a selected 
portion of the ATM cell stream by a fixed amount, ranging 
from one to 79,137 cell times. This means that an OC-3 link 
with a rate of 155.52 Mbits/s will have a maximum delay of 
221) ins, approximately the delay of one satellite hop. 

As illustrated in Fig. 4, cells first enter the write block, 
which determines whether they match the user-specified 
Filter. Delayed (matching) cells are then written to the con- 
stant delay RAM, and nondelayed (nonmatehing) cells are 
written to the to the null delay FIFO. 

The constant delay RAM is divided into four IM-byte blocks 
using 10-bit bytes. For each byte, eight bits store the ATM 
cell data (DT(7..0|), one bit stores the empty cell indicator 
(EC), and the last bit stores the cell start indicator (CS). As 
delayed cells enter the module they are written in sequential 
ordei into RAM. 

The constant cell delay module is very straightforward. The 
write block writes ATM cells (as each is received from the 
cell bus) to I he constant delay RAM. The read block reads 
from memory locations some increment behind where the 
write block is writing, and places the cells found there back 
on the cell bus for transmission. Thus, each ATM cell stays 
in the c onstant delay RAM a specified period of time before 
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it is transmitted. This period of time is an integer multiple of 
the time required to transmit one cell, and is specified by the 
user. 

Cell Delay Variation Module 

The cell delay variation module emulates cell buffering, net- 
work congestion, and multiplexing delays in ATM networks 
by varying the amount of delay applied to cells. Depending 
on the line rate, maximum delays can range from 2.62 ms 
(with a 155-Mbit/s SONET/SDH line interface) to 220 ms 
(with a 1.544-Mbit/s Tl line interface). Each cell in the 
stream is given a different amount of delay. A critical feature 
is that in an ATM network, cell sequence integrity is guaran- 
teed. This posed a technically interesting problem, since 
simply delaying each cell to create the desired statistical 
dist ribution would put cells out of sequence. This problem 
was solved by the use of a Markov chain model, which is 
illustrated in Fig. 5. 

The network impairment emulator's cell delay variation 
impairment module is implemented as a birth-death Markov 
process. '•- Each matching cell (a cell whose header values 
match a user-specified filter) is delayed by a specified 
amount. This amount is determined by the current state of 
the Markov chain. For example, a matching cell entering 
the cell delay variation module when the Markov chain is in 
state i will undergo a delay of i cell time intervals. To pre- 
serve the cell sequence integrity, the state of the Markov 
chain cannot be changed until the arrival of a nonmatehing 
cell (which includes idle or unassigned cells). On the arrival 
of each nonmatehing cell, random number generators are 
used to determine whether the Markov chain remains in the 
same state or changes to a higher or lower state. 
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Fig. 4. Block diagram of the Constant cell delay module 



Fig. 5. Birth-death Markov chain model. The x, and Mi are birth and 
death rates. 
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Consider the cell sequence MiN|No....N|<Mo which arrives ai 
the network impairment emulator when the* Markov chain is 
in slate n. Mj and N, denote the jlh matching and nonmatch- 
ing cells in the sequence, respectively. Furthermore, let Dj 
denote the delay associated with the it It matching cell and T 
be the cell interval time. Note that the value of Di is nTat 
the arrival time OfMj and will change to (n- 1 )Tat the 
arrival time of N[. The value of D| will continue to decrement 
and will lie (n — k - 1 )T at the arrival time of Mo. Next, con- 
sider the possible values of Do. Since the Markov chain can 
only change state at the arrival of a nonmatching cell and 
there are k nonmatching cells between M| and Mo, Do at the 
arrival time of cell Mo is bounded by: 

(n - k)T < Do s (n + k)T. 

Therefore, upon the arrival of cell ,\K Do is at least one cell 
interval more than Dj. Hence, cell sequence integrity is pre- 
served. 

The implementation of the cell delay variation module using 
this process does impose some bounds on the delay of two 
consecutive matching cells. However, this constraint is not 
expected to be significant since actual traffic conditions are 
not likely to change greatly. Furthermore, ATM links are 
typically high-capacity links and the load of each individual 
virtual channel is expected to be small relative to the overall 
channel capacity. This implies that for actual network 
emulation, the percentage of nonmatching cells is expected 
to be much greater than the percentage of matching cells, 
thereby reducing the delay dependencies between consecu- 
tive matching cells. 

The following shows how the sieady-siati' distribution of the 
Markov chain can be used to implement the cell delay varia- 
tion distribution. Only the derivation of the binomial distribu- 
tion with parameters n and p will be shown in detail, where 
n is the total number of independent trials ( or, equivalent ly, 
the maximum number of cell-time delays) and p the proba- 
bility of delay on any single trial. Other distributions can be 
derived using the same technique by changing the birth rate 
/.j and death rate |q appropriately. 

Let Jtj be the steady-stale probability of being in state i and 
let the birth and death rates be defined as: 

K, = (n - i)p 
iq = i(l - p). 

By How balance (i.e.. the rate of moving from state i to slate 
i + 1 in the steady state must be equal to the rate of moving 
from state i + 1 to state i): 

+ 1)(1 - p) = .tj(n - i)p. 

Therefore, 

(" ~ OP 
^'-(i+Dll-p)*'- 

By recursive substitution, the steady-state probability of 
each state can be expressed in terms of no as: 



* = 



P 



1-P/- 



Since the sum of the total steady-state probabilities must be 
equal to 1, that is, 

n 

using equation ( 1 ). we can write the steady-state probability 
of state zero as: 



no = 



1 + 



1-P 



l 'sing this expression for Tin in equation 1. we have: 

*| = (i)p'(l - P.)"-', 

which is a binomial distribution with mean up and variance 
np(l-p). 

Implementation of the Cell Delay Variation Module 

Given a particular distribution, one of the first processes the 
network impairment emulator performs is to calculate the 
state transition probabilities of the Markov chain. These 
probabilities are determined by the birth and death rates. 
The slate in the Markov chain will increase if there is a birth 
before a death. Conversely, the state will decrease if a death 
occurs before a birth. The lime between transitions in stale 
is exponentially distributed with mean + u,). Given 
that there is a change in state, the transition probability is 
k/O-i + m) for an increase in state and Uj/(/.j + Mi) f° r a de- 
crease in state, respectively. These probabilities, which are 
written into the table memory, are used together with the 
random number generator to determine the next stale of the 
Markov chain whenever a nonmatching cell arrives. 

Fig. li shows a simplified block diagram of the cell delay 
variation module. Each incoming cell is examined by the 
write block, then matching and nonmatching cells are written 
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Fig. 6. Block diagram of the cell delay variation module. 
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to the matching and nonmatehing cell FIFOs respectively. 
For each nonmatehing cell, the random number generator is 
used together with the table memory to determine the new 
state of the Markov chain if a state change has occurred, For 
each matching cell, the state of the Markov chain is sent to 
mirror memory, which acts as a control to the read block, 
indicating whether to extract the next outgoing cell from the 
matching or nonmatehing cell FIFO. 

( ontenlion between the matching and nonmalching cells 
may occur at the read block. In such cases, priority is given 
to the matching cell, and the nonmatehing cell is transmitted 
at the next available slot. This is done to preserve the cell 
delay variation distribution set by the user as closely as pos- 
sible. If there is no matching cell to send and the nonmateh- 
ing cell FIFO is empty, the read block generates an idle cell. 

Cell Error, Loss, and Misinsertion Modules 

Recause the cell error, loss, and misinsertion modules have 
similar implementations, they will be described together. All 
impairments for these modules use a statistical distribution 
generator to create the specified distribution. This module, 
illustrated in Fig, 7, contains three pseudorandom genera- 
tors, all implemented using different polynomials to ensure 
that (tie pseudorandom processes are independent. 

The pseudorandom number generator creates 12-bit num- 
bers with a cycle of 2 1 '. The lK-by-10-bit memory stores the 
statistical distribution (or more precisely, the cumulative 
distribution function), with the III bits representing the im- 
pairment range of 10""" to 10 ~ '-. The sequence of events is 
as follows: 

• The statistical distribution is loaded into RAM. 

• The pseudorandom number generator generates an address. 

• The resulting data is latched and then loaded into the down 
counter* 

• The down counter decrements by one as each cell whose 
header matches the impairment filter transits the network 
impairment emulator. 

• When the down counter reaches zero, a flag is set to inform 
the impairment module that the next matching cell should 
be impaired (cell loss and cell error modules) or a cell should 
be insetted into the stream (cell misinsertion module). 

• The zero detector drives the pseudorandom number genera- 
tor to create a new number, and the process repeats. 

Fig. S shows the block diagram of the cell loss, error, and 
misinsertion modules. Cells enter into a header selector. 
The MATCH indicator will become active for each cell flowing 
through tin- network impairment emulator provided that the 
user selects all cells to be impaired. Otherwise, it will be 
active only for selected channels. The MATCH indicator goes 
to the statistical distribution module. 

Cells pass through the impairment module unimpaired if the 
INSERT signal is not active. If the INSERT and MATCH signals 



are both active, the cell will be errored. lost, or replaced by 
an empty cell. 

Both the cell error and the cell loss modules have the added 
capability of introducing bursts of impairments. These impair- 
ments are also specified as statistical distributions through 
the use of a simple probability function. The burst length 
ranges from one to four bits for the cell error module and 
from one to eight consecutive cells for the cell loss module. 
For example, if a user selects a burst length of four with a 
probability of one for the cell error module, each occurrence 
of INSERT + MATCH Will result in four consecutive errors. By 
way of further illustration, setting the probabilit> to I). 125 
for each of the eight consecutive cell loss events will result 
in 12.5% of the INSERT + MATCH events being of each length. 

The cell misinsertion module inserts cells into the ATM 
stream by replacing empty cells with nonempty cells whose 
header (and a part of the payload) are defined by the user. 
As with the cell error and cell loss modules, the distribution 
of the inserted cells is defined by the statistical distribution 
module. 

Implementation 

With the aid of TI+D engineers, we convened the design to 
meet HP's manufacturing standards and then added circuity 
to interface to the BSTS cell bus. The software controlling 
the TI+D network Impairment emulator was rewritten and a 
new GUI created to make the Hewlett-Packard network im- 
pairment emulator control panel appear consistent with I he 
rest of the BSTS products. 

The design work was carried out at two sites: Hewlett- 
Packard's facility in Vancouver, Canada and TI+D's facility in 
Madrid Spain, To coordinate the project and to ensure a 
successful collaboration, we first visited TI+D's site in Spain 
and outlined the changes they would need to make to their 
research hardware. We also specified the interface needed 
between their core ATM impairment logic and our cell bus 
interface. ( luce we both had completed our designs, two 
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TI+D engineers traveled to our site in Vancouver lo help us 
debug the design. 

Circuit Design 

The circuit design had to operate with line interface rates up 
to 15").o2 Mhits/s. Since ATM data is moved in bytes on the 
circuit board, the resulting system clock is lil.-I MHz. This 
limitation in clock speed and the need to insert impairments 
in real time prevented us from considering a microprocessor 
or DSP-based design. The entire network impairment emula- 
tor was implemented using ten large programmable logic- 
parts from Altera and Xilinx and a large number of FIFOs, 
RAMs, and ECL glue logic. For the Altera designs we used 
MAX+PLl'S II software to create a combination of schemat- 
ics and text. We used Mentor's Design Architect for the Xi- 
linx designs. 

Because of time constraints, the entire design was not simu- 
lated as a unit. We used either Altera's MAX+PLl'S II or 
Mentor's QuickSim II to simulate each individual program- 
mable device. The ECL-based cell bus was simulated com- 
pletely using QuickSim II. 

Although this worked reasonably well, a few timing-related 
issues cropped up concerning timing margins between 
F1F< )s, RAMs, and programmable logic parts that took con- 
siderable effort to isolate and correct. In hindsight, a more 
productive approach would be to simulate the programmable 
logic and related RAM components together, rather than 
attempting to debug subtle errors in the lab. 

Applications 

The following are some typical tests users conduct with the 
network impairment emulator. 

Fig. 9 shows two ATM users communicating through a net- 
work that uses a satellite link. Delays from the satellite 
uplink and downlink are relatively large and fairly constant. 
Cell delay variation caused by buffering at the ATM switches 
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Fig. 9. Emulation of a satellite link with the HP Broadband Series 

Test System and the HI' E4219A network impairment emulator. 
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Fiti- 10. Determining acceptable quality of service In a digital 
video system. 

is also likely, as are bit errors and cell loss. Users can emu- 
late all these impairments simultaneously with the network 
impairment emulator by setting the appropriate parameters 
and enabling the constant cell delay, variable cell delay, cell 
error, and cell loss modules. 

Fig. 10 illustrates how the network impairment emulator 
can determine the acceptable level of quality of sendee in a 
video-on-demand application. In this test setup, the video 
server generates an MPRG-2 video transport stream, which 
is then transported by the ATM network. At the receiver end, 
the set-top box decodes the MPEG-2 transport stream and 
displays the picture on the monitor. One of the most impor- 
tant issues in MPEG-2 technology is the recovery of the sys- 
tem clock. In the above scenario, video service providers or 
set-top box manufacturers would be interested in knowing 
how cell delay variation affects the system c lock recovery at 
the receiver end. The network impairment emulator's vari- 
able cell delay module can help determine buffer size and 
other quality of service factors by increasing the standard 
deviation of the delay distribution until the display visibly 
degrades. 

Conclusion 

Hewlett-Packard's third-party collaboration with Telefonica 
Investigacion y Desarrollo of Spain was highly successful. 
HP's understanding of the ATM market and product com- 
mercialization combined with TI+D's technical expertise in 
ATM networking were merged to produce the world's first 
155-Mbit/s ATM network emulator within the targeted mar- 
ket window. 
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A Message Handling System for 
B-ISDN User-Network Interface 
Signaling Test Software 

B-ISDN user-network interface signaling has many different protocol 
variants and each of them has tens of different types of messages The 
message handling system provides a powerful tool for the developer to 
easily support these variants and messages in the HP Broadband Series 
Test System (BSTS). 

by Satoshi Naganawa and Richard Z. Zuo 



Over the pasi several years, as the "informal ion superhigh- 
way" has gained the attention of many countries and com- 
munication network service providers, the focus of the 
Broadband Integrated Services Digital Network (B-ISDN) 
industry has shifted from providing transmission capabilities 
via Asynchronous Transfer Mode (ATM) to providing a vari- 
ety of B-ISDN network services such as private and public 
B-ISDN. switched virtual connections, and LAN emulation 
over ATM. Signaling plays a key role in realizing these ser- 
vices. The key functions of signaling are: 

• Call setup and disconnection 

• Negotiating and agreeing on quality of service (QoS) 

• Providing accounting and charging information 

• Resolving internetwork and intranetwork routing 

• Facilitating b;isic services and supplementary services 

• Identifying the occurrence and cause of any network or 
service failure. 

Signaling consists of user-network interface (UNI) signaling, 
network-node interface (NNI) signaling for public networks, 
and private network-node interface (PNNI) signaling for 
private networks. The I'NI is the boundary between a pri- 
vate or public network and the user equipment. I ser equip- 
ment can be a telephone, a computer, or video conference 
equipment as illustrated in Fig. 1. 




Fin. l< I Iser-nelwork Interface ( I Nl), network node Interface 
(NNI), and private net work-node interface (PNNI I signaling 



The Broadband ISDN I NI signaling lest software. HP 
E4214A, is a new addition to the HP Broadband Series Test 
System (BSTS). It is designed for R&D engineers and net- 
work engineers to develop, troubleshoot. and verify the fea- 
tures and functionality of I'NI signaling. Fundamental fea- 
tures of the I 'NI signaling test software are protocol message 
decoding and encoding. With decoding, the user can monitor 
the message transaction between the user equipment and an 
ATM switch. With encoding, the user can execute a condition 
test such as constructing a correct or incorrect message and 
sending il to a switch lo analyze the response. 

UNI Signaling Standards 

As VIM emerges as the mainstream technology in the lele- 
conimunicalion network market, the standardization of UNI 
signaling has been progressing al a very rapid pace during 
the last several years. Two major standardization organiza- 
tions — the Inlernalional telecommunications I nion Tele- 
comnmnicalions Standardization Sector (ITU-T) and the 
ATM Forum — created their UNI signaling standards in an 
interlocked way. In May 1993, ITU-T published its first draft 
recommendation Q.93B' to support point-to-point ATM 
switched virtual connections. The ATM Forum adapled 
Q.9.1B to support point-to-multipoint connection control, 
variable-bit-rate connections, and the ability to signal quality 
of service (QoS) on a per-caJl or per-connection basis. These 
enhancements were published in the ATM Forum UNI 3.0 
specification- in September 1998. baler, the ITU-T Updated 
Q.93B as Q.2931 3 with some important changes. Following 
those changes I he ATM Forum also updated UNI .1.0 into 
I Nl 3.1 4 in September 1994. In the meantime, the ITU-T 
defined point-lo-mullipoinl switched virtual connection 
features and published its standard as Q.297U in June 1995. 
AbOUl hair a year later, the ATM Forum published I'NI 4.0" 
to incorporate the leaf-initiated join capability and the avail- 
able bil rate i ABR) service in Q.2931 and Q.2971. In addition 
to the protocols promoted by the ITU-T and the ATM Forum, 
some countries and network service providers also created 
their own UNI signaling protocol variants. From this brief 
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Fig. 2. Spuing up a connection 
between two user equipments 
via signaling.' 



history, il ran be understood why there are many UNI signal- 
ing protocol variants. Undoubtedly, more variants w ill come 
soon to support more new features. 

In addition to having many different UNI signaling protocol 
variants, each variant has many different types of messages. 
A message is a protocol data unit (PDU) used in a signaling 
protocol. Each message can contain multiple information 
elements, called IEs. Some IEs must be contained in a par- 
ticular message and are referred lo as mandatory IEs. Some 
IEs ate optional for a particular message, and therefore will 
not always appear in that message type. For example, the 
I NI 3.0 standard- has Hi different types of messages and 21 
different types of IEs. SETUP, CALL PROCEEDING, CONNECT, and 
CONNECT ACKNOWLEDGE are the messages for call establish- 
ment, as shown in Fig. 2. The other categories of messages 
are designed for call clearing, point-lo-multipoint communi- 
calion. and other purposes. 

The situation described above created a challenge for HP's 
BSTS UNI signaling test software developers. The software 
needed to support many protocol variants, with many differ- 
ent types of messages and IEs in each variant. Simultaneously, 
the R&D cycle time was much reduced. We realized that 
hard-coding each message and IE for each protocol variant 
in the decoding and encoding engines would make software 
maintenance a nightmare, would require the programmer to 
change the decoding and encoding engines whenever adding 
a new protocol variant, and would be of little help to us in 
developing NNI and PNNI signaling test software in the future. 



Message Handling System 

The message handling system was developed to solve this 
problem. The message handling system: 

• Separates the structure description of messages and IEs 
from the remainder of the software 

• Specifies the structure descriptions using a prolocol-specifi- 
calion-like language in separate script files 

• Compiles the script files to generate an object file 

< Reads the object file into a database and allows the other 
parts of the BSTS to access those messages and IEs through 
library functions. 

Fig. 3 depicts the architecture of the message handling 
system. 

The message handling system describes a signaling protocol 
in one or more t ext files by means of a script language called 
the Message Description Language or MDL. A compiler con- 
verts the MDL script files into an object file by using the lex 
and yacc utilities provided by the UNIX* operating system. 
Like most compilers, the MDL compiler consists of a parser 
and a code generator. The parser reads an MDL text file, 
checks the syntax, and then puis the information in an inter- 
nal data Structure. The code generator generates an object 
file from ihe internal data structure. 

With ease of use as a top priority, we designed the MDL as a 
simple protocol-specification-like language that allows a 
developer to easily translate a UNI signaling protocol speci- 
fication into a script file. The MDL is also generic enough to 





MDL Files 

MDL = Message Description Language 
MSD = Message Structure Database 
MHS = Message Handling System 
BSTS = Broadband Series Test System 
PDU = Protocol Data Unit 
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handle different types of messages and IEs used in different 
UNI signaling variants. In the future, such as when using the 
message handling system for NNI and PNNI signaling, only 
slight changes are needed in the MDL and its parser. 

Fig. 4 show s what the MDL looks like. This is a very simple 
and artificial example, but one that demonstrates some of 
the most important common c haracteristics of all types of 
messages. This example will also be used later to explain 
several interesting points in the message handling system 
structure database. The octet-field map with notations shown 
in Fig. 4a to describe a message structure is a widely used 
style in the signaling specifications and is easily translated 
into an MDL script as shown in Fig. 4b. 

The message structure database object file shown in Fig. 3 
is designed as a binary disk file version of the message 
structure database. The disk file format lets us easily sepa- 
rate the MDL compiler from the message handling system 
library so that a compiler developer doesn't need specific 
knowledge about the other parts of the test software such as 
the decoding and encoding engines. Unlike the MDL files, in 
which the description for a single protocol can be separated 
into several source files if more convenient, all the informa- 
tion for a single protocol is put into a single message struc- 
ture database object file to make the run-time loading easier. 

The message handling system library in Fig. 3 provides 
application programming interfaces (APIs) for the message 
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{ 



name "simpl_example" ; 

octet 1 required { 

field Field_A[4] Field_A_Prm; 

field Field_B[4] . [12. .9] Field_B_Prm; 

> 

octet 2 required ( 

field Field_B[8] . [8. .1] Field_B_Prm; 

) 



octet 3 Note 1 ( 

field Field_C[8] 



Field^C_Pnn; ) 



rule Note_l { 

if (decode (Field_A) == 0) { 

return required; 
) else { 

return absent; 



) 



) 



(b) 



Fig. 4. A very simple example of n Message Description Language 
(MDL) script, (a) The specification, (b) The Ml 'I- scrip) 



handling system. The other parts of the BSTS can access the 
protocol-specific information stored in the message struc- 
ture database through the library functions. Functions are 
available to: 

• Initialize the message handling system library (this includes 
loading a message structure database object file to generate 
a message structure database) 

• Decode a signaling PDF 

• Construct a default signaling PDF 

• Encode a signaling PDL" 

An example of using the message handling system library is 
displaying captured PDl's. After a signaling PDL" is captured, 
a message handling system library function is called to ex- 
tract the values for all fields by using the protocol knowledge 
stored in the message structure database. Another message 
handling system library function is called to convert those 
values into a proper text format. A playback viewer in the 
other parts of the BSTS can display the formatted decoded 
result on the screen. 

Another example of using the message handling system 
library is construction of a signaling PDF by the user. A 
PDL" editor in the other parts of the BSTS calls a message 
handling system library function to create a PDU, which 
uses the most frequently used values as its default values, 
and shows the PDF on the screen. After the user makes any 
modifications to the PDU, the PDU editor calls another li- 
brary function to put each field value in proper bit and byte 
order in an octet stream, as described by the protocol specifi- 
cation, to construct a new signaling PDU. During the interac- 
tion, the appearance of the user interface can change with 
the user's input. These dynamic changes are also controlled 
by message handling system library functions described 
later. 

Message Structure Database 

The message structure database shown in Fig. 3 is generated 
in the application memory space after a message Structure 
database object file is loaded by the message handling sys- 
tem library, The message handling system library can load 
several different message structure database object files to 
generate several different message Structure databases within 
l he same application. This allows the user to use several 
different protocols simultaneously. 

The message Structure database is a set of data structures 
that define the structure of a PDU and give information 
about how io decode, encode, and construct a PDU. The 
message structure database Ls the core of the message han- 
dling system, and it is generic enough to support not only 
I'M signaling but also NNI and PNNI signaling. The key 
points in successfully building the message structure data- 
base are If) correctly analyze the common requirements to 
express different types of messages and IEs, and lo design 
proper data structures to meet these requirements. Some 
important requirements are: 

• Mapping an octet format of a PDU into a field format or vice 
versa 

• Handling field dependencies 

• Handling predefined values for parameters 

• Automatic inclusion of mandatory IEs and optional IEs 
selected by the user 

• Automatic calculation of message length. 



© Copr. 1949-1998 Hewlett-Packard Co. 



April I!'!'" llrwlell-l'a.kanl Journal 53 



Octei Table 



Mapping Table 



Field Table 



IF* * 


Map a 
Mapl. 
Mapc 


— §> 


Fit-Id A 1 
ReUB 
Field C ■ 




Mapd 





Fig. 5. Mapping the octet formal into the field format. 

In the remainder of this section, the mapping and handling 
field dependencies are used as examples to show in detail 
what these general requirements entail and how the message 
structure database meets them. 

The bit stream in a PDU can be interpreted into either an 
octet format or a field format. A single octet can contain 
part of a field or one or more fields, and in turn, a single 
field can span part of an octet or one or more octets as 
shown in Fig. 4a. The octet format is usually used in general 
computing, such as computing the length of a PDU. The field 
format is more significant when the user is only concerned 
with the value of a field but not with the field's bit posit ion. 
In most cases, such as when decoding or encoding a PDU, 
both formats are important. Implementation requires three 
tables — the octet, field, and mapping tables — for each mes- 
sage in the message structure database. Pig. 5 shows how 
to map the octet format into the field format using these 
three tables for the example displayed in Fig. 4. Each ele- 
ment in the mapping table is defined by the data structure 

mhs_msd_map_r shown in Fig. 6a. The definitions of the vari- 
ables in mhs_msd_map_r are explained in Fig. 6b by showing 



0 










■ 



Field A 
Field B 
Field C 



WhenlieldA - O.lieldCis 
required (sensitive). 




When field A * 0, field C is absent. 



Fig. 7. An example of field dependency. 

how map b in Fig. 5 maps the second part of octet 1 with the 
first part of field B. 

In a signaling message, modifying the value of a field can 
change the definitions or values of other fields. This is called 
a field dependency. Usually the effect of the field dependency 
needs to be interpreted in real time. For the example shown 
in Fig. 4, Note 1 indicates that field C is present when the 
value of field A is equal to 0. and is absent otherwise. There- 
fore, on the user interface of a PDU editor, the sensitivity of 
field C should be changed with the value of field A in real 
time as indicated by Fig. 7. 

The message handling system internal interpreter was devel- 
oped to handle field dependencies. The internal interpreter 
includes two parts: an internal interpreter table for each 
message in the message structure database to store the field 
dependency descriptions, and an internal Interpreter engine, 
running a simple and small stack-oriented language, to inter- 
pret these descriptions in real lime. Fig. 8 shows how the 
internal interpreter works in I his example. 



typedef struct 

{ 

unsigned int pos_octet; 
unsigned int pos_field; 
unsigned int map_id_ octet; 

unsigned int map id. field; 

unsigned int size; 
struct octet_struct 'octet; 
struct field_struct *field; 
) mhs msd__map_r; 



(al 



Mapb 



pos octet = 4 



pas field = 12 



map id octet = 1 
map id field = 0 



sue = 4 (bits) 
octet ■ <a pointer.} 
field = <a poinler.> 



(bl 
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Fig. 6. The mapping data struc- 
ture and an example, (a) The 
data structure for a map element, 
(bj Detail of map b in Fig. 5. 
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rule Noce_l { 

if (decode (Fieid_A) == 0) < 

return required; 
} else { 

return absent; 

} 

) 



typedef struct 
( 

connnands_e 
unsigned int 
unsigned int 
} mhs_mii_code_r; 



command ; 
valuel; 
value2 ; 



typedef enum 
{ 

required = 0, 
optional = 1, 
absent ■ 2 
) inclusion_type_e; 



Mil Table 




Command 






Comments 


push 






Index ol field A is 0. 


decode 


\ 






push 






0 is the criterion lor 
decode (Field A). 


eq 








push 


1 




Index ol block|1] 


push 


2 




Index ol block|21 


il 









Command 


Value 1 


Value 2 


Comments 


push 


0 




0 indicates required 



Command 


Value 1 


Value 2 


Comments 


push 


2 




2 indicates absent 



Push in 0. 



Pop out as the lield 
index Decode 
field(O) and push in 
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r 



Stack 



Fig. 8. An example for lite message handling system internal interpreter, (a) The MDL script, (b) The internal Interpreter data structure 
in the structure database, (v.) How the internal interpreter engine works. 



The MDL compiler converts the field dependency description 
from the MDL scripl shown in Fig. 8a into an internal inter- 
preter table indicated by Fig. 8b. In this case, the internal 
interpreter table consists ol" three blocks of internal inter- 
preter codes. Block(0] indicates how to evaluate the expres- 
sion in the if(expression| statement. Bloek|l] will be executed 
if the expression is due and block[2] will be executed Other- 
Wise. Each block contains a sequence of internal interpreter 
code. Block[0] contains multiple lines of internal interpreter 
code, while block| 1) and block[2] each contain only a single 
line. A line of internal interpreter code is defined by the struc- 
ture mhs_mii_code_r, which contains a command and its corre- 
sponding values if applicable. For example, the command 



push has only a single applicable value located in value I. 
The comment columns in each block give the explanation of 
the values used in each line. 

When the user changes the value of field A from 0 to 7, the 
internal interpreter engine is called to execute the codes 
Started from hloek[0| to evaluate the expression (decode(Field_A) 
— 0) as shown in Fig. 8c. The expression is not true in this 
case, since the value of field A is 7. and therefore b!ock[2] is 
executed. The absent value, which is defined as 2 in the data 
Structure inclusion_type_e, is returned as the last result of rule 
Note 1 (Fig. 4a). This makes the I'Dl' ediior appropriately 
change field (' from sensitive (required) lo insensitive 
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(absent) to meet the real-time interpreting requirement indi- 
cated by Fig. 7. 

Results 

L'sing the message handling system to add a new l"NI signal- 
ing protocol variant, the only work that a developer needs to 
do is to create a script file. In the script files the developer 
can easily describe the messages in the variant by using the 
protocol-specification-like language. The message handling 
system hooks the script file into the other parts of the BSTS 
automatically, uses the protocol-related knowledge to de- 
code and encode PDUs, and produces the desired results. 

In this section, the organization of the general message and 
one of its instances, the SETUP message, are used as examples 
to demonstrate the MDL script and the results provided by 
I he BSTS. The SETUP message is one of the most importaiii 
types of messages used in L'NI signaling since it carries all 
lite information for requesting a connection between two 



generalmessage Message 

{ 

octet 1 

octet 6 required { 

/* this octet is required by the 

specification */ 

field MeBsageType [8] MessageType = 5; 

/* MessageType is the variable name. 

[8] indicates the width is 8 bits 

and value 5 (SETUP) is the default 
value */ 

) 

octet 9 

) 

(a) 



Protocol Discriminator 



lenglh ol Call Reference Value 
0 0 0 linOdels) 
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Variable Length Information Elements as Required 



8 7 6 5 4 3 2 1 


Meaning 


000----- 


Call Establishment Messages 


0 0 0 1 0 


CALL PROCEEDING 


0 0 t 1 1 


CONNECT 




0 1111 


CONNECT ACKNOWLEDGE 1 


0 0 10 1 


SETUP 






Information Element 


Direction 


Type 


AAL Parameters 


Both 
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Both 
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Call Party Number 
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Octets 
1 



lei 

Fig. 9. General message organization and SETUP message specifies* 
lion. 2 (a) general message organization, (b) Predefined values for 
message types, (c) IE inclusion of the SETUP message. 



message SETUP 
{ 

id ObOOOOOlOl; 

/• binary identifier for SETUP message */ 

AALParam both optional; 

/* both directions are optional */ 

ATMCellRate both mandatory; 

/* both directions are mandatory */ 

BBearerCap both mandatory; 



CalledNum 
QosParam 



both mandatory; 
both mandatory; 



Ibl 

Fig. 10. MDL scripts, (a) Part of the MDL script for the general 
message organization, fb) Part of the MDL script for the SETUP 
message. 

user equipments. Fig. 9 shows parts of the protocol specifi- 
cation from L'NI 3.O. 2 Fig. 10 is the corresponding MDL 
script. 

Fig. 9a is the simplified definition of the general message 
organization. When the value of octet 6 is equal to 5 (binary 
value 00000101). the message type is defined as SETUP as 
illustrated in Fig. 9b. Fig. 9c depicts the IE inclusion for the 
SETUP message. The "Direction" column indicates whether 
an IE is used in the user-to-network direction, the network- 
to-user direction, or both. 

Fig. 10 shows two segments of the MDL script with C-style 
comments. Fig. 10a corresponds to the specification of the 
general message organization shown in Fig. 9a and Fig. 10b 
is the translation for Fig. 9b and Fig. 9c. 

Fig. 1 1 shows the user interface of a UNI signaling PDl T 
editor provided by the BSTS after it hooks into the message 
handling system. As defined by the specification shown in 
Fig. 9a and translated by the MDL script shown in Fig. 10a, 
a message includes the following major parts: 
Protocol discriminator 
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Fig. 11. 1'singthe PDTJ editor to construct a SETUP message. 



Call reference 
Message type 
Message length 

Information elements, as required. 

From the option menu in the Message Type item, the user can 
select any type message to edit. For the selected type, the 
Type text field automatically displays the corresponding digital 
value and the Information Elements list shows all the mandatory' 
IKs. In this example, the message has been Interpreted as a 
SETUP message (value 5) and four mandatory IKs have been 
listed according to the MDI, script in Fig. 10b. The user can 



also edit an IE by clicking the Edit button and add optional 
IEs by clicking the Append button. 

A PDl" is constructed by clicking the OK or Apply button. All 
the field values specified by the user are put in proper bit 
and byte positions according to the description of the MDI, 
script. 

Using the BSTS playback viewer, Fig. 12 shows the decoding 
result of the constructed message in this example. Fig. 13 
illustrates the hexadecimal format of the message. In Fig. 12. 
the window contents are in two pails: the message header, 
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j~ This octet defines the message type. 
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Fig. 13. The hexadecimal format of a message. 



which includes the first nine octets in the general message 
organization, and part of the first contained IE. the ATM 
user cell rate. Notice that the value of octet 6 in the message 
header is 00000101 in binary, which is interpreted as SETUP 
according to the MDL script shown in Fig. 10b. With this 
decoding result, the user can easily monitor the transaction 
between the user equipment and an ATM switch. 

Summary 

The message handling system provides a powerful tool for 
developers to handle many UNI signaling protocol variants 
and also many different types of messages and IEs used in 
each variant. The message handling system significantly 
improves the maintainability, extensibility, and flexibility of 
the UNI signaling test software. Because there are a lot of 
similarities between UNI signaling protocols and NNI/PNNI 
signaling protocols, it would be possible to reuse the message 
handling system were we to add NNI and PNNI signaling test 
software to the BSTS in the future. The message handling 
system enables HP to keep its competitive position in the 
signaling test software field, in which old protocols are con- 
stantly updated and new protocols are constantly emerging. 
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Object-Oriented Network Management 
Development 



As networks continue to proliferate, the need to develop and deploy 
network management applications has become a critical issue. Two 
software development tools are described that allow developers to create 
powerful network management-side applications quickly without 
necessarily having to be experts on network protocols. 

by Peter E. Mellquist and Thomas Murray 



The advantages or using an object-oriented design methodol- 
ogy and its associated productivity tools, such as C++,' are 
well entrenched and accepted in today's software engineer- 
ing practices. One of the areas where the object-oriented 
paradigm can bring a great deal of productivity is in the 
development of network management applications. 

This article describes two management -side' tools for net- 
work management development: SNMP++ and SNMPGen. 
SNMP++ is a set of C++ classes that provide Simple Network 
Management Protocol ( SNMP) services to a network manage- 
ment application developer- SNMP++ brings the object- 
oriented advantage to network management programming, 
and in doing so. makes it much easier to develop powerful, 
portable, and robust network management applications. 
SNMPCien is a technology that uses SNMP+ + to automati- 
cally generate C+ + code from the SNMP Management Infor- 
mation Ha.se (Mil?) definitions. Fig. 1 shows a conceptual 
view of these two tools. 

this is id contrasi to agent-side lechnrilogies The manager/agent model, which is used tor 
nelwort management, is analogous to the client/server model lor distributed applications 
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SNMP++ was developed by Hewlett-Packard and is a freely 
available open specification. Anyone can obtain the specifi- 
cation document on the World-Wide Web at \'RLhllp://mse- 
gardefcexterriaLhp.a/m/snmp++. 

Benefits of SNMP++ 

Various SNMP application programming interfaces (APIs) 
exist that allow the creation of network management appli- 
cations. The majority of these APIs provide a large library 
of functions that require the programmer to be familiar with 
the details of SNMP and SNMP resource management. Most 
of these APIs are platform-specific, resulting in SNMP code 
that is specific to a particular operating system or network 
operating system platform, and thus, not portable. 

Application development using a standard set of C++ 
classes for network imuiagemenl provides many benefits 
including ease of use. safety, portability, and extensibility. 

Ease of Use. SNMP++ is designed so that application pro- 
grammers do not need to be concerned with low-level SNMP 
mechanisms. An object-oriented approach to SNMP encapsu- 
lates and hides the internal mechanisms of SNMP. SNMP+ + 
provides the following ease-of-use features: 
It provides an interface in which the user does not have to 
be an SNMP or C++ expert to use its features, For the most 
part C pointers do not exist in SNMP++. resulting in a simple 
and straight forward API. 

It provides easy migration to SNMP version 2. :> - 1 ( )ne of the 
major goals of SNMP++ was to develop an API that would 
be scalable to SNMP version 2 with minimal impact on code. 
It preserves the flexibility of lower-level SNMP programming. 
A user may want to bypass the object-oriented approach and 
code directly to low-level SNMP calls. 
It encourages programmers to use the full power of C++. 

Programming Safety. Most SNMP APIs require the program- 
mer to manage a variety of resources, such as memory and 
sockets. Improper allocation or deallocation of these re- 
sources can result in corrupted or lost memory. SNMP++ 



Fig. I. A conceptual view of (a) SN.MP+* and lb) SNMI'llcn. 
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SNMP 



SNMP (Simple Network Management Protocol) is by far the most popular 
system and network management protocol. 1 More devices and systems 
are managed with SNMP than any other management protocol. This is 
largely because SNMP is quite small and inexpensive to deploy, meaning 
that it can be implemented in devices with minimal memory and CPU 
resources. This is in contrast to the Open Systems Interconnect (OSI) 
management protocols which are complex and more expensive to deploy 

SNMP was developed to provide a basic, easy-to-implement network 
management tool for the Transport Control Protocol/Internet Protocol 
(TCP/IP) suite of protocols. This includes a framework of operation and 
a representation of management information within the framework. The 
Structure of Management Information (SMI) specification provides the 
definition of Management Information Bases (MIBs). 2 MIBs are analo- 
gous to database schemas. MIB definitions are the basis of interaction 
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between a managed entity and a managing entity (see Fig. 1}. The 
managed entity, called an agent, includes one or more MIBs that define 
the managed information This includes a standard set of management 
information which is part of the SNMP framework, and a set of vendor- 
specific management information which allows a vendor to instrument 
specific parts of a device or process. The ability to define custom MIBs 
allows SNMP management to be extended to meet the needs of a ven- 
dor's device. The managing entity, or manager, is responsible for making 
requests to the agent for management information and handling re- 
sponses from the agent. 
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manages these resources automatically. For programming 
safety SNMP++ addresses the following areas: 

• Safe management of SNMP resources. This includes SNMP 
structures, sessions, and transport layer management. 
SNMP++ classes are designed as abstract data types provid- 
ing data hiding and the ability for public member functions 
to inspect or modify hidden instance variables. 

• Built-in error checking and automatic timeout and retry. 
SNMP++ users do not have to be concerned with providing 
reliability for an unreliable SNMP transport mechanism. A 
variety of communications errors can occur, including lost 
datagrams, duplicated datagrams, and reordered datagrams. 
SNMP++ addresses each of these possible error conditions 
and provides the user with transparent reliability. 

Portability. Another major goal of SNMP++ was to provide a 
portable API across a variety of operating systems, network 
operating systems, and network management platforms (see 
Fig. 2). Since the internal mechanisms of SNMP++ are hid- 
den, the public interface remains the same across any plat- 
form. A programmer who codes to SNMP++ does not have 
Co make changes to move the program to another platform. 
Another issue in the area of portability is the ability to run 
across a variety of protocols. SNMP++ currently operates 
over tiie Internet Protocol (IP) or Internet Packet Exchange 
(IPX) protocols. 



Extensibility. Extensibility is not a binary function, but rather 
one of degree. Extensions to SNMP++ include supporting 
new operating systems, network operating systems, network 
management platforms, protocols, SNMP version 2, and other 
new features. Through C++ class derivation, SNMP++ users 
can inherit what they like and overload what they wish to 
redefine. 

Overloading SNMP++ Base Classes. The application program- 
mer can subclass the base SNMP++ classes to provide spe- 
cialized behavior and attributes. The base classes of SNMP++ 
are meant to be generic and do not contain any vendor- 
specific data structures or behavior. New attributes can be 
easily added through C++ subclassing and virtual member 
function redefinition. 

Example 

Fig. 3 shows an example of the power and simplicity of 
SNMP++. This example obtains a system descriptor object 
from an SNMP Management Information Base (MIB) from 
the specified managed SNMP agent. Included is all the code 
needed to create an SNMP++ session, get the system des- 
criptor, and print it out. Retries and timeouts are managed 
automatically. 
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Fig. 2. An illustration of the different operating systems, network- 
operating systems, anci network management platforms to which 
network management applications based on SNMP+t can be ported. 

The SNMP++ calls in this example total only ten lines of 
code. A CTarget object is created using the IP address of the 
ageni. A variable binding object (Vb) is then created using 
the object identifier of the MIB object to retrieve the system 
descriptor (SysDescr). The Vb object is then attached to a 
PDU (protocol data unit) object. An SNMP object is used to 



invoke an SNMP Get request. Once retrieved, the response 
message is printed out. All error handling code is included. 

SNMPGen: Machine-Generated SNMP Code 

Traditional SNMP development is fairly low-level and redun- 
dant. SNMP+- 1 programming requires a thorough understand- 
ing of the MIB definition used and the ability to perform 
manual fabrication of each PDU that is sent and received. 
When dealing with large amounts of MIB data, manual fab- 
rication becomes tedious and somewhat prone to errors. 

Consider, for example, traversing through a column in a MIB 
table. In SNMP this is performed by loading the object ID 
of the table column into a PDU and then performing SNMP 
GETNEXT operations on the table column until the object ID 
of the variable returned no longer matches the object ID of 
the table column. Although SNMP++ makes a task like this a 
little simpler for programmers, there still remains some 
complexity. 

SNMPGen is a set of classes that were created to go beyond 
the limitations of traditional SNMP development and to fur- 
ther reduce the complexity of developing network manage- 
ment tasks such as the table traversal task mentioned above. 
These classes allow a higher level of abstraction over SNMP. 
Rather than deal at the level of PDUs and ASN. 1 (Abstract 
Syntax Notation One) syntaxes, as is typical in SNMP devel- 
opment. SNMPGen offers an abstraction of MTBs and MIB 
objects. The resulting framework allows the user to write 
code focused on the manipulation of MIB objects with little 
concern about the underlying SNMP access details. Addition- 
ally, because SNMPGen is based on SNMP++, it can offer all 
the same advantages as SNMP++. 

SNMPGen Classes. A MIB contains managed objects thai are 
represented as tables, scalars, and groups. Each managed 



#include "snmp-pp.h" 

tdefine SYSDESCR "1.3.6.1.2.1.1.1.0"// Object ID for System Descriptor 

void get_system_descriptor ( ) 
{ 

int status // return status 

CTarget ctarget ( ( IPAddress ) "10.4.8.5"); // SNMP++ vl target 
Vb vb( SYSDESCR); // SNMP++ Variable Binding Object 

Pdu pdu; // SNMP++ PDU 

// Construct an SNMP++ SNMP Object 

Snmp snmp( status); // Create an SNMP++ session 

if (status != SNMP_CLASS_SUCCESS ) { // check creation status 

cout << snmp.errorf status); // if fail, print error string 

return; ) 

// Invoke an SNMP++ Get 

pdu += vb; //add the variable binding to the PDU 

if ( (status = snmp. get! pdu, vltarget)) != SNMP_CLASS_SUCCESS) 

cout << snmp.errorf status); 
else { 

pdu.get_vb( vb,0); // extract the variable binding from PDU 

// print out the value 
cout << "System Descriptor = "<< vb.get_printable_value( ) ; } 



Fig. 3. A simple SNMP+-I- example. 
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Object 
Identifier 



Name 



1.3.6.1.2.1.1.1.0 sysOescr N 
1.3.6.1.2.1.1.2.0 sysObjeclID 
1.3.6.1.2.1.1.3.0 sysUpTime 
1.3.6 1.2 1 1 4.0 sysConlact 
1.3.6.1.2.1.1.5.0 sysName 



Components 



sysDescr OBJECT-TYPE 

SYNTAX DisplaySlring (SIZE 10 2551) 

MAX-ACCESS read-only 

STATUS current 

DESCRIPTION 

"A textual description of the entity. This value 
should include the full name and version identifi- 
cation of the system's hardware type, software 
operating system, and networking software." 

::=( system 1 ) 



object has a name and a unique object identifier and attrib- 
utes such as syntax and description (see Fig. 4 ). Tables 
contain one or more indexes (pointers to other objects) and 
zero or more managed objects. A scalar represents a single 
managed object. Groups can contain objects, tables, and 
other groups. 

A MIB also defines the relationships among managed ob- 
jects. Fig. 5 shows the relationships among SNMPGen 
classes. 

SNMPGen Operations. A set ol" powerful operations (methods) 
are available for the various SNMPGen classes. These opera- 
tions allow the manipulation of objects, columns in tables, 
entire tables, and groups. Many of the operations are con- 
ceptually the same for all of the classes. As a result it takes 
the same amount of code to get an entire MIB as it does to 
get a single MIB object This dramatically reduces the coding 
effort for writing management applications. 

All of the classes have operations to obtain values from 
an SNMP agent (by performing SNMP GET operations) and 
operations to send values to an SNMP agent (with SNMP SET 
operations). Additionally, the classes can have specialized 
operations that help with manipulating specific types of 
objects. For example, the CSNMPManagedTable class has 
operations for manipulating rows and columns in a table. 

All SNMP considerations are hidden beneath the SNMPGen 
operations. However, many of the SNMP++ classes are used 
to manipulate the basic syntax of MIB variables. While nearly 
eliminating the requirement for detailed SNMP knowledge, 
this approach provides all the power and flexibility offered 
by SNMP++. 



Fig. 4. Identifiers for the contents 
of a Mil) (Management Information 
Base) showing object identifiers 
and the object names. Also shown 
is the MIB definition of a managed 
object. 

In addition to the operations provided, the classes can be 
extended to perform any tasks specialized for a particular 
application. Tltis can be done simply by deriving new classes 
from the SNMPGen classes anil defining the operations for 
the new classes. 

Mibmgr. SNMP' len classes allow modeling of MIB objects, bill 
they do not give the objects context. For example, instead of 
having a generic group class, a developer mighl want a class 
that represents a group called MIBII' or maybe a system class 
that represents the system group. A tool called mibmgr was 
developed to derive classes representing these kinds of spe- 
cific MIB objects from the classes provided by SNMPGen. 
The result is a class definition for every object in a MIB. 
Developers then use these classes in SNMP management 
applications. 

MIB documents tend to be long and somewhat difficult to 
read. The mibmgr tool was first developed to help the read- 
ability of MIBs by displaying them in a simple form. For 
example, MIBII would be reported as shown in Fig. 6. 

This output was created by parsing the output of a MIB com- 
piler tool called MOSY (Managed Object Syntax Compiler 
yacc-Based), which comes wilh the SNMP subagent develop- 
ment environment licensed from SNMP Research and main- 
tained by HP's Network System Management Division. MOSY 
takes a MIB definition as input and produces a compact, 
easily parsable output file. This output file is then used as 
input to the mibmgr tool (see Fig. 7). 

MIBII is the second generation ot standards 'or managed SNMP Dejects as defined bv the 
Internet Engineering Task force. 



CSNMPManagedGroup I CSNMPOb|CclTypo 
of I 

Tabular or 



CSNMPManagedTable 
IA Collection ol Objects 
lhat Make up a Table) 



CSNMPTableObiectType 
(Derived Tabular Object) 



CSNMPScalarObjectTypc 
(Derived Scalar Object! 



Fig. 5. The relationships between 
SNMPGen classes. 
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Object Type 


■QBE 




[1] mib_2 




[1] 9ysteio 




I1J sysDescr 


DisplayString 


[2] syBObjectID 


Object ID 


[3] aysnpTime 


TimeTicks 


(4} sysCoatact 


DisplayString 


[5] sysName 


DisplayString 


[6] sysLocation 


DisplayString 


[7] sysServices 


INTEGER 



[2] interfaces 

tl] lfNumber 
, [2] ifTable 
tl] IfEntry 
[1] if Index 
[2] ifDesc 
[3] ifType 



INTEGER 
Aggregate 
INDEX "iflndex" 
INTEGER 

DisplayStringr 
INTEGER 



[...] 



Fig. 6. TIip output produced by the mibmgr for the example M1BII. 

We quickly realized thai this same information could be 
used ill conjunction with the SNMPGen classes to produce 
C++ header files for SNMP management applications (see 
Fig. 8). Mibmgr was augmented to produce class definitions 
for each variable in a MIB. This gives context to the 
SNMPGen classes by completely defining the object ID 
space as well as the entire Mffl hierarchy. 

The following code shows an example of a machine-gener- 
ated scalar object. (Note that mibmgr creates a C++ header 
file for the entire MIB and only the portion relevant to this 
example is shown.) 

#define SsysDescr "1.3.6.1.2.1.1.1" 
class CsysDescr: public CSNMPScalarObjectTypef 
public : 
CsysDescr ( ) : 

CSNMPScalarObjectType (SsysDescr, 
sNMP_SYNTAX_OCTETS) { ) ; 

}; 

Similarly, a machine-generated group would appear as 
follows: 

class Csystem: public CSNMPManagedGroupt 
public : 
Csystemt ) : 

CSNMPManagedGroup(Ssystem) { 
AddObject (fcsysDescr ) ; 
AddObject (SsysObjectID) ; 
AddObject (SsysUpTirae) ; 

); 

CsysDescr sysDescr; 
CsysObjectID sysObjectID; 
CsysUpTime sysUpTime; . . . 

); 



This scheme has several benefits. Since the class definitions 
are produced in C++ header files, any changes to the MIB 
have little effect on management applications. The C++ 
header files also allow easy extensions to the classes if 
needed. Machine generation of class objects greatly reduces 
the possibility of coding errors and typing mistakes. 
SNMPGen combined with the machine-generated classes 
allows the developer to deal only with MIB objects and hier- 
archies- The use of the machine-generated classes essen- 
tially makes a MIB definition the development specification. 

Example. Fig. 9 shows a minimal program that takes a host- 
name as an argument and returns the system descriptor. The 
example assumes that the mibmgr tool was used to create the 
header file mib2objs.h from the MTBII specification. A sample 
output from the example in Fig. 9 is as follows: 

S minite8t hpisrhx 
SysDescr = HP-UX hpisrhx A. 09. 01 E 9000/720 
$ ininitest hpinddh 

Failed to issue SNMP get: Timed Out While 
Waiting for Response Pdu 

The first case worked, and in the second case, no SNMP 
agent was running on system hpinddh, so the request timed 
out. The key points to note from this example are: 

» Header files include the SNMP++. SNMPGen and its 
machine-generated files (mib2objs.h includes the SNMP++ 
and SNMPGen headers), and the standard C++ I/O library. 

» SNMP++ target and SNMP classes are required to perform 
MIB access. 

• The machine-generated classes produced by SNMPGen 
have the same name as defined in the MIB specification, 
with a C prepended (e.g., CsysDescr). 

• Retrieving the object is done using just one operation 
against the object, which in this case is sysDescr.SNMPGetl), 
The member function SNMPGetO is inherited from the base 
class CSNMPObjecfType. 

• Accessing the retrieved data is also done as a single opera- 
tion against the object, which in this case is sysDescrGetPrinta- 
bleValuel). The member function GetPrintableValue is inherited 
from the base class CSNMPObjectType. 

Practical Uses of SNMPGen 

SNMPGen classes combined with the machine-generated 
classes make it possible for management applications to be 
developed quickly and efficiently. Engineers writing manage- 
ment applications are frequently not networking experts. 
The classes allow SNMP management without requiring 
SNMP expertise. Additionally, when correctly used, the 
classes can reduce complexity and redundancy in manage- 
ment code. 

Several managemenl projects have been based on the SNMP- 
Gen classes. The HP ClusterView product, which is used to 
monitor MC/ServiceGuard high-availability dasters, uses the 
SNMPGen classes to collect cluster information. The tool 
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Fig. 7. The processes involved in creating a MIB report 
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((include <iostream.h> 

((include "mib2objs.h" 

main (int argc, char **argv) { 

int status ; 

CsysDescr sysDescr; 

int status ; 

CsysDeBcr BysDescr; 



// result of operation 

// SNMPGen created class from mib2objs.h 

// result of operation 

// SNMPGen created class from mib2objs.h 



if (argc I- 2 ) { 
cerr << "Usage: 
return (1); } 

IPAddress destination (argv [1J ) ; 
if ( [destination. valid) )) { 



<< argv[0] << " <hostname>" << endl; 



// use command line arg as destination 
// verify we resolved this address 



cerr << "Bad hostname: 
return ( 1 ) ; } 



<< argv[l] << endl; 



CTarget target (destination) ; 
Snmp my_session ( status ) ; 



// Create an SNMP++ Target 

// Create an SNMP++ Snmp object 



if (status) ( // Ensure the Snmp object is OK 

cerr << "Failed to create SNMP Session" << status << endl; 
return ( 1 ) ; } 

status = sysDescr .SNMPGet (my_ session, target);// Access the MIB-II object 
if (status) { // Check for success 

cerr << "Failed to issue SNMP get: " << my_session. error (status) << 

endl ; 

return(l); ) 



cout << "SysDescr = " << sysDescr .GetPrintableValue( ) << endl; // display 
return (0) ; 



Fig. 9. A program that uses SNMPGen classes to find a particular host (using a hostname parameter) and returning the MIBII system 
descriptor. 



was designed and written by engineers with very little net- 
working, SNMP, or even C++ experience. The Web page 
http://hpgsslhc.cup.hp.com/ClusterView explains this product. 

Additional applications include MIB browsing tools and MIB 
monitoring tools. Since the classes have MIB information 
embedded within them, they can provide good representa- 
tions of MIB Structures. Additionally, each class can answer 
questions about MIB object syntax, descriptions, and so on. 

Alternatives 

Development of network management applications that 
need to manage network devices or network systems is not 
an easy task. Often these devices operate in heterogeneous 
environments where one or more operating systems may be 
present. This can place additional requirements on a manage- 
ment application. Specifically, an application might need to 
run on more than one operating system, like the UNIX oper- 
ating system and Microsoft "'Windows. 

Before SNMP++. the ability to use a production-quality set 
of object-oriented SNMP tools did not exist. Without one 
common set of SNMP libraries, experts are required for 
each operating system platform and each different SNMP 
library. Not only does this require additional engineers for 
development, but testing and maintenance are also adversely 
affected. SNMP++ allows reused code across any platform 
that supports ANSI/ISO C++ and supports Berkeley sockets. 



including all derivatives of the UNIX operating system and 
Microsoft Windows. Clearly, having to code to and Support 
multiple SNMP APIs is not a viable alternative. 

Conclusion 

Creating the SNMPGen classes and the mibmgr tool was not a 
traditional project. It was done as a side job and encountered 
many obstacles, the foremost being lack of time and re- 
sources. The project pioneered the use of C++ development 
in the division and exposed many issues with the traditional 
development and integration model. For example, many of 
the tools had never been used with C++ code before. 

The use of SNMP++. SNMPGen. and the machine-generated 
C++ header Files demonstrates the power of object-ori- 
ented design and programming. The object-oriented para- 
digm allows complete freedom for developers. New attrib- 
utes can be assigned to the class definitions and the classes 
are fully extensible. The use of these classes greatly reduces 
development effort. Engineers do not need to know net- 
work programming or leant about SNMP to create SNMP- 
based management applications. 

As more and more networked devices and distributed sys- 
tems are used in todays communication-intensive world, 
SNMP will play a major role in remote management. 
SNMP++, SNMPGen, and the classes created with the mibmgr 
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tool will help developers meet the need for network and 
systems management by enabling rapid development. 
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Design of an Enhanced Vector 
Network Analyzer 



A liquid crystal display (LCD) reduces size and weight and has a larger 
viewing area. TRL (Thru-Reflect-Line) calibration allows measurement 
of components that do not have coaxial connectors. New software 
algorithms achieve faster acquisition and frequency tuning of the 
synthesized source to give faster updates of the measurement data. 

by Frank K. David, Frederic W. Woodhull II, Richard R. Barg, Joel P. Dunsmore, 
Douglas C. Bender, Barry A. Brown, and Stanley E. Jaffe 



The development of a new measurement instrument always 
involves technical challenges. This is also true for the 
enhancement of an existing family of instruments with the 
added challenges of getting the enhancements to market in 
exactly the right form and in a timely manner. 

The measurement instrument called a IKCtOT network ana- 
lyzer ( see Fig. 1 ) is the integrat ion of a tunable source, a 
multichannel receiver, and a test set for signal routing and 
separation, all under the control of an embedded CP!'. The 
block diagram in Fig. 2 shows the interconnection of these 
components, which for the family of instruments described 
here are all integrated into one package. 

The purpose of a vector network analyzer is to measure the 
magnitude and phase of the reflection and transmission 
characteristics of a microwave component (historically 
called a microwave nelwork) as functions of frequency. The 
component is inserted between the test ports, the test signal 
is rapidly timed over a span of frequency, and portions of this 
signal are reflected from and transmitted through the com- 
ponent. These reflected and transmitted signals are ratioed 
with a portion of the test signal that has been tapped off into 



the reference channel to display the component's reflection 
and transmission characteristics as functions of frequency. 
This information is needed by designers to know if their 
components will work properly when inserted into a micro- 
wave system. 

The accuracy of the measurement, which depends on sepa- 
rating the component's characteristics from the characteris- 
tics of the instrument, is greatly improved by the ratioing 
process. The accuracy is further enhanced by a calibration 
process that measures known standards (calibration compo- 
nents) to characterize and mathematically remove the re- 
maining systematic errors in the instrument. 

The components that are measured with a vector network 
analyzer are used in a wide variety of commercial and mili- 
tary products: cellular phones, broadcast and cable TV', long- 
distance telephone transmission, satellite communications, 
microwave ovens, airplane radar, missile guidance, and so 
on. Each of these applications uses dozens of components 
that are developed and produced with the aid of a vector 
network analyzer. Thus, the accuracy and measurement 




Fig. 1. HP 8720D vector network 
analyzer. 
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Fig. 2. HP 8720D vector network 
analyzer block diagram. Two refer- 
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speed of the vector network analyzer will have an impact on 
the performance and cost of these products. 

The family of IIP vector network analyzers known as the IIP 
872(1 family is composed of three members that differ in the 
frequency range over which they can make their measure- 
ments. The HP 8719D, 8720D, and 8722D cover the frequency 
ranges from 50 MHz to 13.5 GHz, 50 MHz to 20 GHz. and 
50 MHz to 40 GHz respectively. The family made its first 
appearance early in 1988 with the introduction of the HP 
8720A and has evolved over the years to the introduction 
of the D models in May 1900. 

The evolution or the HP 8720 family from the A models to 
the C models was to improve the instruments' html ware 
performance. The recent evolution from the C to I he D 
models was to respond quickly and directly to a change in 
users' needs. More and more the instrument was being used 
in a production measurement role as opposed to the pre- 
vious R&D role. This change brought with it a need for more 
measurement speed and expanded I/O capability to allow 
easier and better integration into a larger production lesi 
system. Also, the industry' trend of higher levels of integra- 
tion of components made it necessary to measure compo- 
nents that do not have coaxial connectors — for example, 
a probe station measuring amplifier chips on an IC wafer. 
We wanted CO respond with something that not only met the 
new measurement needs, but also had a new look: a state- 
of-the-art display technology and a smaller, lighter package. 

The list of improvements we could have made was long, but 
we could only choose a few because we wanted to respond 
quickly. This required careful selection and sorting of the 
customer inputs that come to us directly or through our field 
sales force. One difficulty in soiling these inputs was the 
fad that one fundamental need can manifest itself in several 
different ways— for example, after thorough investigation, 
an issue of instrument calibration turned into an issue of 
measurement speed. Another difficulty was sorting the 
unique but strongly stated needs from the broadbased but 
quietly slated needs. 



Once the sorting and prioritizing were done, we focused on 
the most important changes that also allowed us the most 
leverage from existing designs. When this was done, we 
proceeded to improve the HP 8720 family with these design 
changes: 

• A liquid crystal display (LCD ) gives size and weight reduc- 
tions plus a larger viewing area. These are important features 
in a production environment. 

• En the HP 8720D Option 400 models, hardware and software 
additions implement a calibration technique called TRL 
(Thni-Refleet-Line). which uses four receivers: two reference 
and two measurement. This allows accurate measurement 
of components that do not have coaxial connectors. Imple- 
menting TRL required some careful multiplexing of the 
receivers' outputs because the instrument is limited to three 
data channels. 

• New software algorithms achieve faster acquisition and 
frequency tuning of the synthesized source to give faster 
updates of the measurement data. 

Liquid Crystal Display Electrical Design 

There were many advantages to be gained by converting the 
instrument's display from the traditional color CRT to a color 
LCD. as can be seen in Table I. In particular, note the signifi- 
cant decrease in volume, weight, and power consumption 
and the increase in viewing area. However, as beneficial as 
this conversion is. implementing it in a short time presented 
some significant challenges. 

Originally, the display interface was designed to drive a 
monochromatic, digital, vector CRT display with a state 
machine controlling stroke and font generators. The state 
machine executed four basic commands (set condition, plot, 
graph, character) which were stored in a display list. When 
the monochrome display was later replaced by a color raster 
display, the same interface was used, making the transition 
to color more or less a drop-in replacement ( with necessary 
changes to control the color attributes of lines and text). 
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Table I 

LCD versus CRT Display Characteristics 









LCD 




7.5-in CRT 


8.4-in LCD 


Advantage 


Volume 


9745 em 3 


240 cm 3 


-97.5% 


Weight 


5 kg 


0.33 kg 


-93.4% 


Power Consumption 


35W 


3W 


-91.4% 


Viewing Area 


170.4 cm 2 


221.5 cm 2 


+ 30.0% 


Horizontal Size 


15.6 cm 


17.1 cm 




Vertical Size 


10.9 cm 


12.96 cm 




Aspect Ratio 


0.7 


0.75 




Pixel Frequency 


35.9 MHz 


25 MHz 




Horizontal Frequency 


25.46 kHz 


31.41 kHz 




Vertical Frequency 


59.9 Hz 


59.8 Hz 




Vertical Resolution 


400 


480 




Horizontal Resolution 


1024 


640 





With the color raster CRT. the display interface was built 
around a graphics system processor, which emulates the 
state machine's stroke and font generators. Supporting 
blocks for the graphics system processor include DRAM to 
store the graphics system processor code and display list. 
VRAM to store the pixel data for the raster display frame 
and fonts, and a color palette, which uses the pixel data to 
address a color lookup table which then drives a triple DAG 
to generate the analog RGB outputs to drive the CRT. The 
block diagram for this is shown in Fig. 3. The pixel data is 
4 bits, which allows 16 simultaneous colors to be displayed. 
Lookup table registers have 12 bits, which allows selecting 
one of 4096 c olors. 

Changing from the color CRT to a color LCD in a short lime 
did not allow creation of a totally new display interface. The 
focus was to address each of the differences listed in Table I 
This resulted in changes of display resolution, timing, aspect 
ratio, data format, power requirements, and test patterns. The 
resolution change required changing the x-axis and y-axis 
scaling in the graphics system processor plot and graph rou- 
tines and the size of the bitmapped fonts. The display timing 
change required modifying the values of the graphics system 
processor timing register value. The change in aspect ratio 



required changing the host code that generates circles to 
maintain their roundness. This change affected the aspect 
ratio of the printer output, which required a modification in 
the graphics system processor code controlling the horizon- 
tal scale factor. The data format change required changing 
from the CRT's analog RGB data to digital data. The color 
palette did not provide access to the digital data, so it was 
replaced with a static RAM for the color lookup table and a 
triple \ideo DAC, whose output is used for driving an exter- 
nal VGA monitor, as shown in Fig. 4. The power supply had 
to be changed to the 3.3Vdc required to drive the display, 
but more important, this voltage had to be withheld from the 
display until the video synchronization signals were present 
or the liquid crystal structure would be damaged. The test 
patterns were changed to allow for testing some of the 
unique characteristics of an LCD: a black screen checks for 
pixels that may be stuck on, primary color bars help evaluate 
the viewing angle, and 16 color bars help evaluate each of 
the LCD's color bits for troubleshooting the digital interface. 

The LCD, which measures 8.4 inches across the diagonal of 
the active viewing area, was originally designed for use in a 
laptop computer. Using it in an instrument required several 
modifications to the display's viewing characteristics to im- 
prove general brightness and grayscale contrast reversal 
( light areas turn dark and dark areas turn light) over a wide 
viewing angle. 

The laptop application optimized its brightness and contrast 
for a relatively narrow viewing angle (approximately ± 20 
degrees off the perpendicular in either the horizontal or the 
vertical direction). Instrument application requires a wider 
angle, typically ± 45 degrees. An instrument display needs 
to have this wider viewing angle to be free of grayscale con- 
trast reversal. The solution to these problems was to modify 
the light polarizing and diffusing films that coat the display, 
and to avoid using colors that are not fully saturated. This 
restriction on the level of color saturation means that to 
achieve maximum viewing angle, an LCD that is capable of 
4096 colors is limited to a total of eight. This certainly limits 
the color flexibility, but it is enough for our four-trace display. 

One advantage of converting from a CRT to an LCD is that 
LCDs don't have convergence, focus, geometric distortion, 
and color purity concerns. This produces a visual image that 
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is sharper and more consistent in color across the entire 
screen. 

Liquid Crystal Display Mechanical Design 

The design of the front panel for the HP 8720D vector net- 
work analyzer incorporates many features that solve prob- 
lems associated with the display, including aligning the dis- 
play with the panel, preventing leakage of electromagnetic 
fields from the display area, ease of assembly, and space 
saving. 

In past instruments with a CRT display, the display and front 
panel inherentiy liad poor alignment with each other because 
of the lack of a common mechanical reference. The LCD 
design, shown in Fig. 5, solves this problem by integrating 
into the front panel the display bezel and mounting posts. 
This allows self-alignment of the display (0 the bezel. The 
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Fig. 4. LCD graphics system. 

mounting posts also control the spacing between the LCD 
and the panel to capture the gaskets and the protective anti- 
reflection glass properly. 

Electromagnetic interference (EMI) from the electrical cir- 
cuitry in the LCD is a problem because it cannot be shielded 
by the instrument's sheet-metal enclosure. To solve this, a 
thin, transparent layer of metal was added to the surface 
of the glass, and this layer is electrically connected to the 
metallic front panel by conductive silver strips along all the 
edges of the glass. To ensure low-resistance, reliable contact, 
special Spring clips attached to the meialized edges of the 
glass make pressure contact to the front-panel casting. In 
addition lo the electrical connection, the clips also locate 
and hold the glass in the panel. The net effect is to produce 
a continuous metal shield over the display without notice- 
ably affecting the display's viewabilily. To provide additional 
shielding for the LCD. a special conductive gasket was added 
to the LCD's metal frame to connect this frame to the edge 
meiallizaiion on the glass. This gasket also hides the LCD's 
metal frame, which would have been visible in the wide bezel 
opening necessary to yield the maximum viewing angle of 
the display's active area (see Fig. 6). 

The key to the ease of assembly is the LCD backing plate, 
which has several functions. It holds the LCD to the bezel 
with five pressure tabs, it provides mounting for the circuit 
board I hat drives the display's backlight, it captures the flex 
circuit that connects the LCD and backlight driver to the 
graphics processor, and it allows access to the backlight For 
easy replacement. The assembly of the LCD to the front panel 
is a simple layering of parts ;ill from one side, and the layered 
parts are all held in place by only three screws. This gives a 
nigged and very compact arrangement. The full depth of the 
assembly from the front bezel to the back of the mounting 
plate is less than 21 mm (compared to 317 mm for a stan- 
dard CRT). This drastic reduction in the depth of the display 
allowed a 20% volume reduction over previous models anil 
reduced Ihe depth of the instrument, which is very important 
because it gives the user more working room in front of Ihe 
instrument. 



Fig. 5. The LCD bezel and mounting posts are Integrated into the 
ihnnunani from panel 
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Fig. 6. To reduce electromagnetic interference, the display has a 
transparent metal layer on Us glass surface, a conductive gasket on 
its frame, and spring clips and silver strips to connect the glass to 
the frame. 

Three-Receiver Analyzer 

To understand the consequences of adding a fourth receiver, 
it is helpful to review the characteristics of the standard 
three-receiver version of the network analyzer. The receiver 
of the network analyzer is synthesized, and the source is 
phase-locked to the receiver through the reference receiver 
path, as shown in Fig. 7. In the standard instrument, the ref- 
erence receiver path is obtained by splitting some of the 
signal from the main signal path. This same splitting point is 
used for the automatic level control (ALC) detection point. 
The main signal path continues to the test set transfer 
switch, which directs the signal through one of the test port 
couplers for forward or reverse signal path measurements. 
The isolation of the switch in the off path creates a cross- 
talk term, limiting the dynamic range of the transmission 
measurements. 



The effects of the system configuration on measurements 
can be tmderstood in the context of system specifications as 
seen from the test ports: power source match, ratio source 
match, load match, test port output power, and test port 
power flatness. 

A key specification for network analyzers is source match, 
which is a measure of how close the test port impedance 
is to an ideal 50-ohm real source impedance. If the driving 
point impedance of a test port is not equal to the ideal refer- 
ence impedance, measurement errors such as ripples on 
transmission measurements will become evident. In fact, 
there are two different source match terms: actual source 
match, which might be referred to as the power source 
mulch, and the source match in a ratio measurement, 
which is referred to as ratio source mulch. Good power 
source match is important for test port power accuracy, 
while good ratio source match contributes to accurate 
measurement of s-parameters. The ratio source match is 
created by taking the ratio of a test signal to the reference 
signal. This provides an improvement over the power source 
match. 1 

For power source match, a sample of the of the source sig- 
nal is routed to a detector, and the level of the source is con- 
trolled to keep the detected signal constant. This has the 
effect of making the split point a virtual voltage source, thus 
having a zero impedance. The series 50-ohm impedance in 
the main path now sets the power source match. The power 
source match in this case is still not exactly 50 ohms because 
some signal that does not come from the \irtual source node 
may couple to the detector as a result of high-frequency cou- 
pling from the main arm output to the detected output. Also, 
any physical resistor will have some reactive component, 
such as series inductance or shunt capacitance. Although 
minute, these small parasitic elements have substantial 
effects at high microwave frequencies. For example, 0. 1 pF 
of shunt capacitance has a reactance of 50 ohms at 30 GHz. 

Ratio source match is very similar to power source match, 
in that the sampling of the reference signal makes the node 
of the sampling point appear to be a virtual source, and the 
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series resistance in the main arm sets the ratio source match. 
As shown in Fig. 7. the same node is used for both power 
leveling (ALC) and ratio splitting, and the same series resis- 
tor is used for setting both kinds of source match. Because 
of differing parasitic coupling factors from the main arm to 
the detector path and from the main arm to the ratio path, 
the power source match and the ratio source match will not 
be identical, especially at high frequencies. For most, mea- 
surements, the main concent is for good ratio source match. 

The load match of a network analyzer is the term used for 
the impedance of the test port that is not the active source. 
It is easy to see from the block diagram that there is a differ- 
ent path for the load impedance in this state. Therefore, the 
source match and the load match of a particular port will not 
be t he same. 

The insertion loss of the switch reduces the test port signal. 
The total loss of the switch limits the maximum power avail- 
able, and the frequency response, or flatness, of the switch 
determines the test port power flatness. The maximum test 
port power is determined mainly by the switch loss, but also 
by the maximum power available from the source amplifier, 
the loss in the reference and detection splitters, and the loss 
in the test port coupler. Because the test port switch is posi- 
tioned between the reference channel and the test channel, 
any switch repeatability errors, or drift, will be errors in the 
measurements. Power flatness is determined by how well 
the detector sensitivity remains constant with frequency. 
However, because the loss from the power leveling node to 
the front panel test port increases with frequency, the detec- 
tor path is designed so that the source power increases with 
frequency, thus compensating for the loss to the test port. 

Incorporating a Fourth Receiver 

Fur the four-receiver version (Option 400), the test port 
switch needs to come before the reference channel power 
splitter, thereby creating two reference signals and two test 
port signals, one for the forward direction and another for 



the reverse direction (Fig. 8). In this way, the switch repeat- 
ability error is "ratioed out" and does not affect measure- 
ment accuracy. Current designs use equal-resistance power 
splitters with a 50-ohm resistor on each leg to obtain the 
reference signal. The power splitters are integrated into the 
test port switch. The switch incorporates a shunt p-i-n diode 
at the splitter node that is forward-biased in the off state 
such that the node has a low impedance to ground and the 
off state main-arm impedance is 50 ohms. The reference 
receiver requires a much smaller signal than the test port, so 
additional external attenuators are usually added. The loss 
to the main arm for this configuration is nominally 6 dB. 
Thus, with no other changes, the maximum test port power 
and the system dynamic range would need to be degraded 
by 6 dB. To reduce the power loss in the four-receiver con- 
figuration, an unequal splitter was designed to tap off less 
power for the reference receiver. This decreases the loss 
through the main arm by about 2 dB, though it does make 
the power source match a bit worse. The ratio source match 
in the ratio mode depends upon the design of the splitter 
circuit (see page 72). 

In the three-receiver version of tlte network analyzer, the 
splitter for power level detection and the reference receiver 
shared the same split node and the same series 50-ohm re- 
sistor. In the four-receiver version, the power level detection 
arm is not part of the splitter, so the power loss from this 
path can be recovered, allowing more power to reach the 
test port. An alternative method for generating a detected 
signal was implemented that uses a custom HP GaAs IC. 

This chip has an unequal-resistor bridge with a built-in de- 
tector diode, which provides the necessary signal to the ALC 
circuit and has less than 2 dB of loss. The splitter-detector 
used in the three-receiver version had about 15 dB of loss, so 
an additional 4 dB of power is recovered by this modifica- 
tion. The detector circuitry is actually integrated into the 

(continued on cage Ml 
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Modeling Source Match Effects for Micro 

Source maich is a difficult measurement to make, as well as a difficult 
characteristic to simulate in a microwave source, because its value is 
evident only by observing the effect of varying a load and measuring the 
forward power available to the load. The degree to which the forward 
power changes with the load is an indication of the source match. 

For simulation purposes, a system was modeled with three different load 
termination impedances, and the power changes resulting from simulated 
load changes were manipulated to generate a ratio source match model. 
Fig 1 shows the system simulation, with the power splitter modeled as 
an ideal two-resistor block 

In the design work that followed, a much more complex model of the 
power splitter generated from transverse electromagnetic ITEM] mode 
microstrip and suspended stripline microwave components was used to 
take into account transmission line effects. The ability of this method to 
predict correctly the response of microstrip circuits is limited to the extent 
that the required circuit configurations do not match standard models or 
that non-TEM mode effects dominate the results. 

Very early in the design it became evident that the simpler microstrip 
models did not adequately account for non-TEM mode coupling between 
elements in the power splitter. While it was possible to add coupled line 
elements to try to account for these effects, no models available con- 
formed to the layout configuration required for the power splitter. Instead, 
a newer modeling technique just developed by the HPEEsof Division was 
used. This program, called HP Momentum, uses finite-element tech- 
niques to solve Gauss's law for fields at each portion {called the mesh) 
of a circuit. 

There were three constraints on the design of this microwave power 
splitter- the source match had to be better than 14 dB, the load match 
(with the common node of the splitter grounded) had to be better than 
14 dB, and the reference channel arm had to have a signal approximately 
20 dB down from the main arm signal. 

Fig. 2 shows the outline for the equal power splitter used in the 40-GH<! 
version of the HP 8720D family, and Fig. 3 shows the first pass of the 
unequal power splitter used in the 20-GHz version, as described in the 
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Fig. 2. Ourline of the equal power splitter 

accompanying article. The TEM simulator predicted good results, but the 
coupling between lines was found to be a dominant effect when an HP 
Momentum simulation was performed on the same circuit. Through some 
iterations between both kinds of simulations, it was determined that the 
large-value resistor for the unequal split needed to be divided between 
the input node and the reference output port. A large value is needed at 
the input node to isolate the mam arm from the reference arm, especially 
at high frequencies, but a large value is also needed at the output so that 
the voltage division that gives the reduced reference signal is applied to 
the expected signal path as well as any signal coupled onto the trans- 
mission line between the input and the output. Finally, that transmission 
line must have a width sufficient to allow placement of a dc blocking 
capacitor, which must be a high-performance single-layer capacitor to 
prevent unwanted resonances in the capacitor structure 




Fig. 1. System simulation with the power 
splitter modeled as an ideal two-resistor 
block. 



72 April l!)i)7 Hewlett-Packard Journal 

© Copr. 1949-1998 Hewlett-Packard Co. 




Fig. 3. First-pass design for the unequal cower splitter 

Fig. 4 shows the final circuit structure with shading from the HP Momen- 
tum simulator showing the current density It is possible to animate this 
display by changing the phase of the input signal and watching the re- 
sulting propagation through the circuit. It was clues from this type of dis- 
play that led to the solution of splitting the large-value resistor to reduce 
the effects of fields coupling onto the reference arm of the splitter 

For each change in the structure of the circuit, another finite-element 
analysis is performed, taking about one hour on an HP 9000 Model 
725/100 workstation with 256M bytes of RAM, to yield a three-port 
s-parameter data set for the new structure. This data set was further 
analyzed by simulation to determine the ratio source match. When the 
design was nearly final, artwork was submitted to the thin-film operation 
for prototyping. The first results were good, bul as with many microwave 
designs, testing indicated that some refinement would improve the per- 
formance, especially in further isolating the two sides of the splitter. The 
final design includes powdered ferrite (also known as polyiron) absorp- 
tive blocks between the arms and some small pads that can be added to 
compensate for mismatch caused by nonrepeatable solder or assembly 
operations Fig. 5 shows the final simulation of power source match and 
ratio source match for the final version of the circuit. 

Joel P Dunsmore 
Development Engineer 
Microwave Instrument Division 
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source amplifier microcircuit, so a new version of this 
module was introduced with the bridge detector circuit and 
Without the reference RF output. This change, along with 
the unequal power split in the swilch-splitter circuit, makes 
up for I he loss and means that no degradation is needed in 
the maximum output power Or dynamic range. 

The integrated bridge detector has a very flat frequency re- 
sponse, so the power out of the amplifier is quite flat . This 
would normally seem like a good thing, but there is a lot of 
(tower loss slope in the swilch-splitter and other components 
on the way to the test port. The detector sensitivity of the IC 
cannot be sloped, so the power flatness could become quite 
bad, something like S dB of variation from maximum to 
minimum power at the test port. Because of this, the design 
of the ALC circuit needed to be modified to accommodate 
this slope. 

The short duration of the project required that the ALC 
changes not result in any instrument firmware changes to 
improv e the power flatness. This requirement was met by 
sampling a portion of the tune voltage of the main oscillator, 
which is linear with frequency, and using it to modify the 
reference for the ALC level. Some diode shaping and adjust- 
able gain and offset allow a range of adjustments that pro- 
vide less than 1.5 dB of power variation over the 20-GlIz 
span of the network analyzer. 

For the 40-GHz version of the network analyzer, the equal 
splitter was retained because the greater loss at 40 < illz re- 
quires a larger signal to the reference receiver. The 40-GHz 
source amplifier was modified with the bridge detector cir- 
cuit to increase the available power by 4 dB. and the final 
power amplifier chip, again a custom HP GaAs IC, was 
changed to a newer high-power version that gives 2 dBm 
more output power. Thus, the output power specification 
and dynamic range of the 40-GHz analyzer are not degraded 
for the four-receiver version. 

Thru-Reflect-Line Error Correction 

One of the classic difficulties in making s-parameter measure- 
ments on a device is that a user would like the measurement 
environment to be as nearly Identical as possible to the envi- 
ronment in which the device will ultimately be used. The 
difficulty is often compounded by a lack of calibration stan- 
dards for a particular environment, or by the limited ability 
of the user to create calibration standards that are predict- 
able and reliable for that environment. 

The Thru-Reflect-Line (TRL) technique 2 - 3 has gathered 
momentum over the past decade because it only requires a 
known characteristic impedance for a short length of trans- 
mission line (the "Line" part of TRL). Furthermore, since an 
infinitely long transmission line can be represented by ter- 
minal ions or loads of a particular characteristic impedance, 
the technique can be extended to an entire family of TRL 
techniques, including Thru-Reflecl-Match, Thru-Reflect- 
Attenuator, Line-Rellect-Line. and so on. Like the IIP 8510 
network analyzer, the HP 8720D Option 400 instruments 
incorporate four samplers and a switch-splitter arrangement 
so that self-calibration using a TRL algorithm can be per- 
formed for high-accuracy s-parameter measurements.'' 

A key to the self-calibration technique is (he source and load 
match error removal, also called removal of switching errors. 
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Fig. !). Forward and reverse paths and signals measured in the TRL 
calibration technique. 

This is accomplished in both the HP 8510 and the HP 8720D 
Option 400 using the algorithm described by Rytling" 1 (see 
Fig. 'J). 

Rytting's paper gives equations relating the measured values 
of the device's s-parameters to the actual values for the gen- 
eralized condition in which the source and load impedances 
are not equal to an ideal 50-ohm impedance. Rewriting these 
equations using notation unique to the IIP 8720D family- 
gives: 



-siin 1 = [(a/r 1 ,-(a7r 2 ')(r 2 /r 1 )]/d 
S2im = [fb/r 1 )-(b7r 2 ')(r 2 /r,)]/d 
si2.„ = ((aVr/)-(a/r|)(r 1 7r 2 ')]/d 
s 22 ,„ = - (b/ri)(r,7r 2 ')]/d 

where d = 1 - ( T 2 /r] )(ri7r 2 0 



(1) 

m 

(3) 
(4) 
(5) 



and the following are complex numbers: s^iuii > s the mea- 
sured value of s-parameter Sfeni a is the A samplers measure- 
ment, b is the B sampler's measurement, r| is the R| sam- 
plers measurement, and r 2 is the R 2 samplers measurement. 

The prime sign (') refers to the reverse path as opposed to 
the forward path of the test set. Ideally, when r 2 = 0 and 
rj ' = 0. the switch path has perfect Z,, impedance, d = 1 . and 
the s-parameters reduce to the single-term ratios. 

In the HP S720D family, the intermediate frequency (IF) paths 
for measuring the reference signals r\ and r 2 are implemented 
by an IF multiplexing board described later. Because of the 
architecture of this board, the magnitudes and phase shifts 
through the various paths are not perfectly balanced. What 
is important, however, is that all of the ratios are made rela- 
tive to the same reference IF path, 

In the HP 8720D, the ratio a/rj is measured using the A IF 
path and the R IF path. b7r 2 ' by the B and R IF paths, and so 
on. The ratios r 2 /ri and fj'ftg' are measured using only the A 
and B IF paths. 

To establish some notation for this, we will use (° and capital 
letters to refer to the IF paths used. For example. a<? A is the 
measurement of a using the A IF path. Thus, the five equa- 
tions above become: 

siim= [(a@A/ri@R)-(a'@A/r 2 '@R)(r 2 @B/ri<s-A)|/d (6) 
S2i m = [(b@B/rr@R) - (b'@B/r 2 '@R)(r 2 @B/ri@A)]/d (7) 
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si2m= [(a'@.Vr 2 '@R) - (a@ Vr,@R)(ri'@A/r 2 '@B)l/d (8) 

S22m = [(b'(gB/r 2 '@R) - (b<g&TiQK)(T l '@AJr 2 '@B)\rd (9) 

d = 1 - (r 2 @B/r I @A)(r 1 '@A/r 2 '@B)- (10) 

If you look at these equations carefully, you will note that 
the measurements of the reference ratios r 2 /ri and ri'/r 2 ' are 
not made relative to the R IF path, but the other measure- 
ments are. Some means for correcting for this difference is 
needed. The HP 8720D allows measurement of r 2 using both 
the R and B paths, and measurement of ri by both R and A 
(see Fig. 10). The correction for IF path differences can be 
reconciled by: 

(r 2 (sB/ri@A)(r 2 @R/r 2 @B)(ri@A/ri@R) = r 2 @R/ri@R (11) 

(r 1 '@A/r 2 '@B)(r 1 '@R/ri'@A)(r 2 '@B/r 2 '@R) = ri'@R/r 2 '@R (12) 

Since r 2 is the same signal to both the R and B paths, the 
measurement ratios for forward and reverse are equal: 



r 2 @B/r 2 @R = r 2 '@B/r 2 '@R 



(13) 



This ratio can be symbolized as r 2 refcomp for r 2 reference 
compensation. 



Likewise, for fj: 

ri@A/ri@R = rr@A/r,'@R 
This can be called rjrefcomp. 



(14) 



Rewriting equations 1 1 and 12, the reference ratios are now 
compensated for the IF path differences: 

(r 2 @B/ri@A)(rirefcomp)/(r 2 refcomp) = r 2 @R/ri@R (15) 

(r 1 '@A/r 2 '@B)(r 2 refcomp)/(rirefcomp) = r.'@R/r 2 '@R (16) 

The original equations 1 through 5 can now be used, as long 
as the reference ratios are compensated as shown in equa- 
tions 15 and 16. 

10-MHz IF Multiplexing Board 

The HP 8720 family of network analyzers was originally de- 
signed with three receivers based on the HP 8753 IF process- 
ing hardware. The first converter, however, was leveraged 
from the HP 8510 network analyzer, which has four receivers. 
Thus, there was a place in the first converter of the HP 8720 
where a fourth receiver could be added easily, but there was 
no fourth channel in the IF processing hardware to connect 
to its output. Therefore, a four-channel-to-three-channel 
multiplexing circuit was added to the HP 8720D, between 
the four first-converter outputs and the three second con- 
verters (see Fig. 10). This circuit switches the 10-MHz IF 
signals so that all four channels can be measured and TRL 
calibration and error correction can be performed properly 
in the Option 400 models. In a standard three-channel instru- 
ment, this circuit does nothing (the switches are always left 
in the same positions). 

At first glance, it would seem that only the R) and R 2 chan- 
nels need to be switched, and the A and B channels could go 
straight through. This would allow measurement of all four 
channels; R), A, and B in the forward sweep, and R 2 , A, and 
B in the reverse sweep. Unfortunately, this would not satisfy 
the requirements of TRL calibration. During the calibration 
process, it is required to measure the vector ratios R\/R> and 
R 2 /Ri, so the additional switching of Fig. 10 is needed. 



Design considerations for the IF multiplexer board were: 

• Zero-dB insertion gain 

• Harmonic distortion below - 50 dBc at - 10 dBm (full scale) 
for good measurement linearity 

• Noise low enough to have little effect on the overall instru- 
ment noise floor 

• IF cross talk between channels less than - 100 dB. 

The circuit design uses a commercial low-noise IC video 
operational amplifier with a disable input pin. When dis- 
abled, the amplifier's output is a high impedance, so two of 
these ICs can be used as a SPDT switch by simply connect- 
ing their outputs together (Fig. 11). This design was pre- 
ferred over a video switch because it made it easier to 
achieve high isolation and it has simple, high-impedance 
input, low-impedance output interfaces. Each amplifier has 
off-isolation of 60 to 70 dB. so two are cascaded together in 
each signal path to give greater than 120 dB. 

The output noise power of the IF switch circuit is — 123 dBm 
in a 3-kHz bandwidth, which is 113 dB below full scale. The 
overall instrument has typically 100 dB of dynamic range in 
a 3-kHz bandwidth, so the IF switch contribution is negligible. 
Second and third harmonics are typically — 56 dBc at a 
- 10-dBm output level. Careful consideration of gains and 
attenuation in the switch circuit was needed to arrive at a 
good compromise between low noise and low distortion. 

The most difficult design goal to meet was the switch off- 
isolation, which directly affects the instrument's cross-talk 
specification. Several iterations of the circuit design and 
board layout were required before the goal was finally met. 
Careful layout, use of multiple ground planes, and bypassing 
the power and switch control signals all played a part. But 
the most vexing problem by far was how the sampler output 
signals were brought onto the board. These four signals 
originally entered the board from the instrument's mother- 
board through two pin-and-socket connectors, but the best 
performance that could be achieved with this arrangement 
was only 510 dB of isolation. It was extremely difficult to 
attenuate the radiation from the connector pins sufficiently. 
Eventually, the design team chose to bring the signals onto 
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the board in flexible coax cables, which was deemed a better 
solution than Crying to shield sections of the connectors 
wilh elaborate sheet-metal pails. The final design achieves 
less than - 100-dB cross talk between all channels, with 
typically - 120 dB in the most important cross talk paths. 

Source Frequency Control Techniques and 
Improvements 

The main components of I lie HP 87201) network analyzer 
source are a microwave YIG oscillator, a sampler, a frac- 
tional-N synthesizer, a pulse generator, a main phase-locked 
loop circuit, and a YIG driver, as shown in Fig. 12. Fig. 13 
shows the main phase-locked loop circuit. 

The source uses a master-slave technique to control the YIG 
oscillator frequency based on harmonic multiples of the 
fractional-N synthesizer output. To set the output frequency, 
the YIG oscillator is first pretuned to the start frequency by 
selling a digital-lo-analog converter (DAC) to a value that 
was previously deriv ed during calibration. The output of the 
DAC is applied to the YIG oscillator driver circuit, establish- 
ing the YIG frequency setting. This simple tuning technique 
is only sufficient for a rough tuning of the oscillator. The 
output is neither precise enough nor stable enough for net- 
work analysis. The fractional-N synthesizer is tuned to a 
frequency that is determined by the formula: 



"film — 



fgc + fjf 
N 



where fr,.„. n is the fractional-N synthesizer frequency, f srt . is 
the YIG oscillator frequency, fjf is the intermediate frequency 
( in MHz), and N is the harmonic number. 

For example, for a source frequency of 10 GHz, ihe frac- 
tional-N frequency would be: 



fnicn 



10'" + uf 

58 



= 172.580207 MHz. 



Once the YIG oscillator is pretuned and the fractional-N 
synthesizer frequency is set, the main phase-locked loop is 
closed and lock is acquired. The fractional-N frequency is 



then swept. Since the YIG oscillator is phase-locked to the 
fractional-N synthesizer, it also sweeps at a rate of N times 
the fractional-N sweep rate. 

One of the limitations of the master-slave tuning technique is 
the YIG oscillator band crossings. The YIG oscillator can be 
tuned over a wide frequency range that includes several 
harmonic bands. The fractional-N synthesizer has a typical 
range of 120 to 240 MHz. To sweep the YIG oscillator over 
its full range requires several band crossings, with relocking 
of the main phase-locked loop at each band crossing. To 
avoid pretuning at each band crossing, a sample-and-hold 
circuit is incorporated as part of the main phase-locked 
loop. In operation, the fractional-N synthesizer is swept over 
its range with die YIG oscillator following. Once the frac- 
tional-N synthesizer has reached its maximum frequency it 
must be reset and the YIG oscillator relocked to the next 
harmonic comb looth. The YIG oscillator tuning voltage is 
held constant by the sample-and-hold circuit while the frac- 
tional-N synthesizer resets to the start of its range. After 
resetting, the loop is closed and lock is reestablished at the 
same frequency, but locked to a new harmonic comb tooth." 

The YIG oscillator frequency is determined by the amount of 
current flowing in its main timing coil. The transfer (unction 
relating main coil current to output frequency is somewhat 
nonlinear. This makes pretuning the YIG oscillator difficult. 
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since a pretune DAC value must be derived for each pretune 
frequency. Pre\ious techniques used a multispline, piece- 
wise linear curve Fitted to the tuning curve of the oscillator. 
However, this complicated the pretune calibration process 
and there was the potential for errors resulting from inaccu- 
rate c urve fitting. 

Another difficulty is YIG oscillator hysteresis. A given amount 
of current applied to the main timing coil will result in a dif- 
ferent frequency output depending on the previous frequency. 
For example, if the oscillator had previously been set to 
20 GHz. and then 1 mA were applied to the main coil, the 
resulting output frequency would be several MHz different 
from the frequency that would result if the previous frequen- 
cy had been 3 GHz. This hysteresis effect results in inaccu- 
racy when pretuning and potential false locking to the wrong 
harmonic comb tooth (N). 

Improvements in Frequency Tuning and Control 

Hysteresis effects of ferromagnetic material are well-under- 
stood and were used to advantage in early computers* core 
memory banks. Hysteresis effects can be minimized by set- 
ting the main tuning coil of the YIG oscillator to zero before 
pretuning. This in effect erases the oscillator's memory of 
the previous output frequency. 

To avoid the nonlinearity of the YIG oscillator tuning curve 
the technique of pretuning dose to the actual start frequency 
was eliminated. The new technique now prettmes to the YIG 
oscillator's minimum start frequency regardless of the actual 
desired start frequency, and the YIG is then swept up to the 
desired Stan frequency. This simplifies the pretune calibra- 
tion process, since only one DAC number is necessary for 
pretuning. No curve fitting is required, since there is no 
longer a curve to fit. This pretuning works well with setting 
the main coil current to zero to minimize hysteresis because 
the YIG is then already at the minimum of its tuning range. 

This tuning technique solves the two difficult problems of 
inning curve nonlinearity and YIG tuning hysteresis. How- 
ever, repeated narrowband sweeps at the upper end of the 
YIG oscillator's tuning range result in long cycle times, since 
the YIG must sweep from its minimum start frequency up to 
its upper frequency limit on each sweep. Setting the main 
tuning coil current to zero to minimize hysteresis also adds 
to cycle time, since the time constant of the large-inductance 



Fig. 13. HP 87201) main phase- 
locked loop block diagram. 

main tuning coil must be taken into consideration. The solu- 
tion to correct for the long cycle time was to allow for 
sweeping the YIG oscillator backwards as well as forwards. 

To illustrate the technique of sweeping backwards versus 
pretuning. take the example of a sweep from 19 to 20 GHz. 
Using the pretuning technique for all sweeps would result in 
crossing through four bands, with the subsequent delays 
resulting from four fractional-N frequency resets as well as 
the long delay required for resetting the main coil current to 
minimize hysteresis effects. Sweeping backward from 20 to 
19 GHz requires no pretuning and no band crossings, and 
therefore results in a much faster cycle time for network 
analyzer measurements. 
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Optimization of Interconnect 
Geometry for High-Performance 
Microprocessors 

The goals of the work presented in this paper were to estimate 
quantitatively the impact of interconnect technology parameters on the 
performance of high-end microprocessors and to use this information to 
optimize the interconnect geometry within the constraints imposed by the 
process. The 64-bit PA 8000 microprocessor was used as a test case. 

by Khalid Rahmat and Soo-Young Oh 



For I he past two decades the driving force for integrated 
circuits has been scaling of both the devices and the inter- 
connect. This has yielded faster and denser chips with ever 
increasing functionality. Today's high-performance micro- 
processors have 4 to 5 million logic- transistors' and oper- 
ate in the frequency range of 200 to 250 MHz. To stay on the 
trend for CPU performance gains, the next -general ion pro- 
cessors will likely have clock speeds of 400 to 500 MHz and 
have up to 10 million transistors for the logic- implementa- 
tion. 1 The design and manufacture of systems that provide 
such performance will require careful attention to all pro- 
cess, device, and circuit issues. 

In particular, the interconnect system for the next-generation 
CPUs needs to sustain die burden of connecting 10 million 
transistors with 5 to G metal layers over a 2 -cm die with a 
clock cycle of 2 to 2.5 ns. Because the interconnect system 
affects the manufacturing yield, reliability, performance, and 
design complexity of a CPU, it cannot be optimized in isola- 
tion but rather must take into account all these aspects. 

The goal of the work presented in this paper was twofold. 
First, we wanted a quanl itative estimate of I he impact of 
interconnect technology parameters on the performance of 
high-end microprocessors. Second, we wanted to use this 
information to optimize the interconnect geometry within 
the constraints imposed by the process. It is well-known that 
interconnect Will become an increasingly significant factor 
in the design of future high-performance processors.' 1 - 3 
Therefore, it is important to hencltmark the current technol- 
ogy with realistic process parameters and circuit designs. 
The detailed layout and extracted data for the HP PA 8000 
microprocessor 1 provided an ideal testbed to perform both 
of these tasks. 

Two criteria were used for optimization: delay or rise time 
(they have very similar characteristics) and cross talk 
noise. The optimization was performed only for the global 
( block-to-block) routing consisting of metal layers 2, 3, and 4. 
Because the longest signal wires are usually at this level of 
routing, optimizing these layers has the most direct impact 



on system performance. For optimization, we only consid- 
ered nets thai had marginal liming based on the expected 
performance target for the processor. 

Optimization Methodology 

Our optimization approach is as follows: 

L Determine the liming-crilical nels using a global static 
timing simulator. 

2. From litis set of nets select a sample that has significant 
global interconnect. 

3. Extiact the driver sizes and wire lengths for this inter- 
connect-dominated subset of the global nets. 

4. For these global interconnect dominated nets, determine 
the sensitivity to all interconnect parameters: metal thick- 
ness, width, and space and dielectric thickness. 

5. For two of the most sensitive parameters perform two- 
dimensional optimization to determine the best design point 
for each metal layer considered (metal 2, 3, and 4). 

The total wire capacitance and cross talk are calculated 
assuming a dense geometry with neighboring lines within 
the plane of the signal line and ground planes above and 
below, as shown in Fig. !, The wire capacitance is obtained 
using the field solver RAPHAEL in 2D, and the cross talk 
and delay are obtained from SPICE simulations using the 
coupling and total capacitances generated from RAPHAEL. 
The cross-talk noise and delay were simulated in worst-case 
situations. For delay litis means that the neighboring lines 
were switching simultaneously in the direction opposite to 
the signal line, and Tor cross talk the victim line was quies- 
cent while the neighboring lines switched simultaneously in 
the same direction. 

Results 

Fig. 2 shows the distribution of metal lengths among the 
nets that failed the timing criterion at the target frequency. 
A significant number of critical nels have long metal 2 and 
metal 1 lines while metal 3 is usually much shorter. This is. 
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Fig. 1. Schematic of the interconnect parameters considered for 
optimization. 

of course, a direct consequence of the orthogonal channel 
routing scheme used by the HP-proprietary automatic router, 
PA-ROUTE. The other salient feature of Fig. 2 is the large 
number of nets thai have 5 to 6 mm of metal 2. Similarly, 
there is a secondary peak of 4 to 5 mm for metal 4. Thus, an 
obvious target for optimization is wires that have this typical 
long length. We did not select the longest wires for optimiza- 
tion because they were not typical. 

The drivers for all of the nets studied were approximately 
44 um wide (W„ = width of n-channel transistor = 44 um; 
W p = width of p-channel transistor = 2Wn). The drivers 
were large enough that the delays and rise times were deter- 
mined primarily by the interconnect and not the driver size. 

Metal 2 Optimization. Using the strategy oullined above, we 
considered nel 1, which has a length of 6.8 mm in metal 2, 
3.6 mm in metal 3 and 0.9 mm in metal 4. Fig. 3 shows the 
impact on cross-talk noise and rise time (at the load) of 
varying only one interconnect parameter while keeping all 
others constant. The nominal values for the metal 2 parame- 
ters are: 

Metal width = 1.2 um 
Metal thickness = 0.7 um 
Metal space = 1.2 um 
Dielectric thickness = 0.95 um. 

It is clear from Fig. 3 that the two metal 2 parameters to 
which die rise time is most sensitive are the metal width and 
thickness. Thus, the next step in our optimization strategy 
varies these two parameters simultaneously to generate a 



two-dimensional contour plot of rise time and cross-talk 
curves, as shown in Fig. 4. When we vary the metal 2 width 
we keep the pitch constant, which is a more realistic situa- 
tion than allowing the pitch to vary. too. To reduce the rise 
rime, either the width or the thickness (or both) of metal 2 
lines must be increased. For example, going from width 
= 1.2 um. space = 1.2 um. and thickness = 0.7 um to 
width =1.4 um. space = 1.0 um. and thickness = 0.8 um 
will yield a reduction in rise time of almost 150 ps (14%). 
From Fig. 4. it can be estimated that this change will pro- 
duce an increase in worst-case cross-talk noise of less than 
100 mV to about 0.52V which may still be acceptable. 

Metal 3 Optimization. We proceed here just as in the case of 
metal 2 by first choosing a net that has significant length in 
metal 3. In this case this is net2 w r hich has 0.7 mm of metal 
2, 3.3 mm of metal 3, and 2.3 mm of metal 4. The nominal 
metal 3 parameters are: 

> Metal width = 1.2 um 

1 Metal thickness = 1.1 um 

1 Metal space = 1.2 um 

i Dielectric thickness = 0.95 um. 

Because of the relatively short length of the metal 3 lines, 
the rise time is not very' sensitive to metal 3 parameters, as 
shown in Fig. 5. Of the four parameters, the sensitivity is 
greatest to the metal 3 spacing and the thickness of the di- 
electric. As before, we choose these two parameters (while 
keeping the metal 3 pitch constant) to perform a t wo dimen- 
sional optimization as shown in Fig. 6. The rise time can be 
improved by using a slightly narrower width (width = 
1.0 um, space = 1.4 um), but the absolute improvement is 
small (20 ps). 

Metal 4 Optimization. For metal 4 we used net3, which has 

1.3 mm of metal 3 and 6.2 mm of metal 4. The nominal 

metal 4 dimensions are: 

Metal width = 1.6 um 

Metal thickness = 1.2 um 

Metal space = 2.4 um 

Dielectric thickness = 1.05 um. 

For metal 4, the rise time is most sensitive to the metal spac- 
ing and the dielectric Utickness as seen in Fig. 7. However, 
for the nonunal metal 4 spacing of 2.4 um, the rise time is a 
fairly smooth function of spacing, so the spacing between 
metal 4 lines could be reduced (while keeping the width 
constant ) by up to 0.4 um without increasing the delay signifi- 
cantly. Thus, the pitch could be reduced without adversely 
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imparting the delay. If the goal is to redure delays, then two- 
dimensional optimization reveals tltal the rise time ran only 
be improved by using a narrower metal 4 and a thirker di- 
elertrir as shown in Fig. 8. Cross talk is a much smaller con- 
cern here because of the wide spacing between metal lines. 

Conclusion 

The main results of this study are as follows 

• Global routing in metal 2 wires causes significant delays 
because of the high resistance associated with this layer. 
The delays can be reduced by increasing the metal thick- 
ness and by increasing the metal 2 width even if the metal 2 
pitch is kept constant. 

• Typical lengths of global interconnect in metal 3 are rela- 
tively short. Therefore, delays incurred because of routing 
in metal 3 are minimal. Hence, the optimization for metal -'3 
should be primarily based on performance within functional 
units. 

• For metal 4. the delays are primarily affected by the capaci- 
tance of the lines rather than the resistance, since the thick- 
ness is substantially greater than for metal 2. Therefore, 
shorter delays require thicker dielectrics and narrower metal 
lines when constant pitch is assumed. 

In our study the gains in performance of up lo 200 ps can be 
obtained by wire optimization within reasonable process 
parameters. A simple example will illustrate the impact of 
an improvement of this magnitude on the cycle time of a 
microprocessor. Consider a hypothetical processor fabri- 
cated in a manufacturing process that yields a nominal cycle 
lime of 4.0 ns (250 MHz), with a standard deviation, assuming 
a normal distribution around the mean of 0.3 ns. hi this case, 
only 7.0% of the devices will have a speed of 280 MHz or 



greater. If the nominal design is changed to reduce the cycle 
time by just 200 ps, then the fraction of the devices yielding 
above 280 MHz will increase threefold to 22.2%. Thus, a 5.0% 
reduction in the nominal cycle time produces a 200% increase 
in the number of the fastest devices, which command a pre- 
mium because of their high performance. 
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Designing, Simulating, and Testing an 
Analog Phase-Locked Loop in a Digital 
Environment 



In designing a phase-locked loop for use on several HP ASICs, the digital 
portion of an existing phase-locked loop was transferred to a behavioral 
VHDL description and synthesized. A behavioral model was written for the 
analog section to allow the ASIC designers to run system simulations. A 
new leakage test was developed that has been very effective in screening 
out process defects in the filter of the original design. 

by Thomas J. Thatcher, Michael M. Oshima, and Cindy Botelho 



This paper describes the design and integration process for 
a phase-lucked loop lhal is being used on several eurreni IIP 
ASICs (appliration-speeific integrated circuits). The design 
is based on die phase-locked loop for a previous ASIC, hut 
several improvements were made. First. I he digital portion 
of the phase-locked loop was transferred to a behavioral 
VHDL* description and synthesized. Reusability was a big 
consideration in writing the code. The portable nature of the 
\ I II )I, code enabled us to design sev eral phase-locked loops 
within a very short time. Second, a behavioral model was 
written for the analog section to allow the ASIC designers 
to run system simulations. This model, when combined with 
the model for the digital section, allows the designer to sim- 
ulate the phase-locked loop as it locks — it does not merely 
put out an ideal clock waveform. Finally, in a previous ASIC, 
a large resistor and capacitor in the loop filter were not ade- 
quately tested. For the new phase-locked loop, we developed 
a leakage lest that has been very effective in screening out 
process defects in the filter. 

An analog phase-locked loop presents several challenges to 
designers in an all-digital design environment. Some all-digital 
simulators, such as Yerilog XL. cannot represent analog sig- 
nals easily. System designers must either use a mixed-mode 
simulator to represent the analog portions of the phase- 
locked loop, or use a simplified model of the phase-locked 
loop. In ASIC production test, limitations of the production 
test equipment must be taken into account. For example, 
an analog measurement may take a long time to complete. 
Also, functional testers cannot measure frequency, so it is 
difficult to determine that the phase-locked loop is operating 
properly in production test. 

Previous Phase-Locked Loop Design 

The prev ious phase-locked loop design appeared on an 
ASIC, where its purpose was to accept an input clock (the 
video clock) and generate the system clock. The clock sig- 
nal output from the phase-locked loop was modulated so 
that the output clock frequency alternated back and forth 

' VHDI. stands for Very High-Speed Integrated Circuit Hardware Description language 



between two frequencies slightly above and below the target 
system clock frequency. This frequency modulation of the 
system clock was required by the system in which the pre- 
vious design was used. 

The block diagram of the previous phase-locked loop is 
shown in Fig. 1. The loop consists of an input counter 
(divide-by-M ). a feedback counter (divide-by-N), a phase- 
frequency detector, a charge pump and filter, a voltage- 
controlled oscillator (VCO), and other digital control logic. 
The input counter divides the input clock by either Mhiigi or 
M|o«- It produces an output pulse (FREF ) that is low for one 
clock cycle and high for the remaining time. The feedback 
counter divides the VCO output by Nygji or N| mv . It produces 
an output pulse (FBAK| that is low for two dock cycles. The 
phase- frequency detector examines the relative phase of the 
rising edges of the FREF and FBAK signals and generates pulses 
on the UP and DOWN signal lines. The charge pump uses 
these pulses to adjust 'be control voltage for the VCO. The 
output signal is generated by dividing the VCO output clock- 
by 4. The resulting phase-locked loop output frequency is 
given by: 



The control block consists of logic that selects one of the 
operating frequencies. A state machine in this block controls 
the modulation between the upper and lower frequencies. 
In normal mode, the control logic sets the phase-locked loop 
to the tipper frequency when it is reset. The loop remains at 
the tipper frequency until the control block receives a signal 
that indicates that the loop is locked. After this, the loop 
alternates between the upper and lower frequencies. This is 
controlled by the N_SEL and M_SEL outputs of the modulator 
counter. 

There were two main problems with the integration of the 
phase-locked loop. The first problem was that there was 
no model of the phase-locked loop in YIIDL or Yerilog that 
could be used for system simulation. Therefore, no simula- 
tions were run with the clock generated by the phase-locked 
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Fig. L Block diagram of the original phase-locked loop design. 

loop. All simulations were run with an external clock, using 
a different chip mode. This caused the design team to miss 
a serious bug in the design. When the mode pins were set to 
enable the phase-locked loop, one block inside the chip was 
accidently set into scan mode. This problem was not caught 
until the first prototype parts were placed on a board. 

The second problem encountered with this phase-locked 
loop design was a high production line failure rate. The 
phase-locked loop tests were not catching all the defective 
parts. Analysis of the returned parts showed that the failures 
were caused by defects in the resistor and capacitor in the 
loop filter, which caused excessive leakage, changing the 
filler characteristics. The production tester had no way to 
test for litis, so a new test had to be created. Here the lack 
of a good simulation model for the phase-locked loop was a 
real handicap. The original tests for the phase-locked loop 
were debugged on the tester. When trying out new tests, we 
had no way of simulating them to verify their correctness. 
Thus, it took two or three iterations before the tests were 
correct. 

New Design Approach 

Other ASICs being designed by our design center required 
phase-locked loops with similar characteristics. These 
ASIC's were designed in a different manufacturing process. 
Therefore, the phase-locked loop had to be redesigned so 
that it would be leveragable to many different chips. As a 
result, it was decided to break up the design into two pans 
(Fig. 2). The digital logic would be designed using a VHDL 
synthesis strategy. Behavioral VHDL code would be written 
and then synthesized into our standard cell library. The code 
would be written so that it was easy to customize for new 
projects. Once the synthesis was complete, the netlist for 
the digital block would be routed and then integrated into 
the top level of the phase-locked loop. The analog portions 
of the phase-locked loop would be designed using a full- 
custom methodology. We would leverage the analog blocks 
from the previous phase-locked loop, but they might have to 
be modified and resimulated. Every new phase-locked loop 
design would undergo a final stability analysis. 



Model Structure 

In the cuzrenl design flow, the customer designs the chip by 
writing behavioral code in VHDL, synthesizing the design, 
and running final simulations in Verilog. To have a phase- 
locked loop model available for system simulations, we 
needed both a VHDL model and a Verilog model. The struc- 
ture of the model is the same for both VHDL and Verilog. 
In the top level of the model are a digital part and an analog 
part. The model used for the digital part is simply the gate- 
level netlist for the digital block. Delays were calculated 
from the routing capacitance and were back-annotated 
using Standard Delay Format (SDF). The model for the ana- 
log part was written separately in VHDL and Verilog, These 
models were written at the behavioral level. 

The logic for the digital section of the new phase-locked 
loop was highly leveraged from the old design. In most 
cases, the original phase-locked loop schematics were used 
as a guide, and the VHDL models for all the blocks inside the 
digital section were written so that the synthesis tool would 
produce equivalent logic. Constants were used in the code 
so that the design could be easily adapted. This made ii much 
easier to change the design when specifications changed. 

The analog section of the phase-locked loop was modeled in 
two parts. The first part modeled was the phase-frequency 
detector. The phase-frequency detector puts out a pulse on 
either its UP or DOWN output whose width equals the distance 
between the rising edges of the divider outputs FREF and FBAK. 
The state table for the phase-frequency detector is shown 
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Fig. 2. Structure of the phase-locked loop model. 
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Fig. 3. State table for the phase-frequency detector. 

in Fig. 3. This was a fairly straightforward model to write, 
since the inputs and outputs were digital signals. 

Model for the Charge Pump and VCO 

Finally, a model needed to be written for the charge pump, 
filter, and VCO. A simplified model of the charge pump and 
filter is shown in Fig. 4. A positive pulse on the UP signal will 
cause the voltage on VCNTL to rise, and a positive pulse on 
the DOWN signal will cause VCNTL to fall. The filter is added 
to ensure the stability of the loop. 

A model was needed that would not slow down the simula- 
tion too much, and that could be written easily in both Veri- 
log and VHDL. One problem with Verilog is that there is no 
easy way to define analog signals (although they could be 
represented with arrays or integers). 

This problem was solved by using a variable of type time to 
represent the period of the VCO output clock. As pulses on 
the UP and DOWN signals arrive at the charge pump model, 
the length of the period is adjusted accordingly. The model 
calculates the width of the pulse on the UP or DOWN signal 
and divides by the maximum period count to determine the 
amount of correction to the period. 




Fig. 4. Simplified charge pump and filter model. 
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Fig. 5. Operation of the cliarge pump. 

The model operates much differently than the analog circuit. 
The operation of the analog circuit is shown in Fig. 5. When 
the UP signal is high, the charge pump charges C| and I he 
voltage VCNTL rises. When UP is low, charge sliifts from C'i to 
C - 2 through the resistor and VCNTL falls. 

In the simulation model of the charge pump and VCO, the 
model calculates the width of the pulse on the falling edge 
of the UP signal. It then applies the correction to the period 
of the VCO. Some time later it will reduce the correction by 
half. This roughly approximates the rise and fall of the VCNTL 
voltage in the real circuit. The operation of the model is 
shown in Fig. 6. 

In practice, this model works v ery well. The model locks to 
the proper frequency, and if modulation is used, the phase- 
locked loop will modulate between the proper two frequen- 
cies. However, the transition from one frequency to another 
will be much different from the actual phase-locked loop. 
Typically, the model locks much faster than the actual circuit. 



n 



36 ns ■ 
Period 33 ns 

30 ns 

Fit;. 6. Operation of the charge pump model. 
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Process for Developing New Phase-Locked Loop 
Designs 

The inteM from the beginning was to create a design that 
was easy to modify and adapt. Each ASIC has its own re- 
quirements for the input and output frequencies. Here is the 
procedure for developing a new customized version of the 
phase-locked loop. 

1. Copy the V1IDL code for the digital block and make- 
changes to customize the code. I'sually this involves chang- 
ing constants in the code for the M counts, the N counts, anil 
the modulation counts. 

2. Simulate the entire phase-locked loop in VHDL to make 
sure that the operation of the digital block is correct. 

■}. Synthesize the digital logic using Synopsys. 

4. Have the resulting netJist placed and routed in a block. 
We use Cadences Cell3 to do placement and routing. 

5. Calculate circuit delays from the routing capacitance. 
Calculate RC delays. 

6. Do a timing analysis for the digital logic using the calcu- 
lated delays. 

7. Do a logic simulation using the real delays. 

8. Make modifications to the phase-locked loop production 
tests. 

9. If necessary, adjust the artwork for the analog blocks 
according to the specifications for the new phase-locked 
loop (for example, to adjust the frequency range of the 
VCO). 

10. Redo the stability analysis lor the phase-locked loop. 

11. Place the routed digital block into the artwork for the 
top-level phase-locked loop. 

Modifications to the digital section of the phase-locked loop 
can be done in about a week. This was very helpful for some 
ASIC designs because the system requirements for the phase- 
locked loop were changed Several limes. Changes to the 
analog section usually lake much longer. 

Problems with the Model 

The availability of the complete model for the phase-locked 
loop was a great advantage. It enabled us to debug produc- 
tion tests for the loop before the chip was released, hi addi- 
tion, it allowed the customer to do system-level simulations 
that included the full modulating capability of the phase- 
locked loop. Our customers have used this capability to 
catch problems in the clocking and W debug corner-case 
problems. However, there were some minor problems with 
the model. 

The first problem is that the model of the charge pump and 
VC( ) is very sensitive to the simulation resolution. The model 
was developed and runs fairly well with a resolution of 
111,0 ps. Il does not work very well when simulations are ran 
with 1-ns resolution. This is because the model needs the 
extra resolution when adjusting the V< l( I clock period. 

The second problem is that it is difficult to initialize the 
model, Because there is a feedback loop in the model, un- 
known values present at the beginning of the simulation 



propagate around the loop. To get the phase-locked loop to 
initialize, it is necessary to put the loop into its digital test 
mode or to force the counter reset signal in the phase- 
locked loop to 0 and then release it. Most people prefer the 
latter option because it requires less effort and doesn't re- 
quire changing the system tests. 

Once the model is initialized and running, it is still suscepti- 
ble to unknown values being introduced. Since the phase- 
locked loop contains two asynchronous clock domains (the 
input clock and the VCO output clock), occasionally there 
will be a setup or hold violation on a synchronizer flip-flop. 
As a result, the output of that flip-flop becomes unknown, 
and this unknown value propagates around the loop. This 
problem could be averted if there were a synchronizer flip- 
flop model that did not go to an unknown state on a setup 
or hold violation. 

Testing and Test Development 

Our experience with phase-locked loop designs has showed 
us the need for more complete testing of the loop. Originally, 
three main tests were created for the production test of the 
loop: 

• An analog test ran the phase-locked loop in all functional 
modes, allowed the loop to lock, and then made match 
assertions for 0 and L This test only tested that the phase- 
locked loop was producing a clock waveform. It did not lest 
t he frequency or duty cycle of the loop output. 

• A digital lest tested the functionality of the digital block 
through four ad-hoc test points inserted in strategic loca- 
tions. 

• A VCO linearity test applied three different clock frequencies 
to the clock inpul of the phase-locked loop. The loop was 
allowed to lock, and then the control voltage to the VCO was 
measured with the precision measuring unit of the tester. 
The lest program then checked thai the control voltage was 
higher for higher frequencies. This test was dropped from Ihe 
production lest because of concerns that the large capaci- 
lancc of Ihe precision measuring unit was altering the voltage 
measurement when it was connected. 

These three lesls did not fully cover the phase-locked loop. 
The digital logic was covered pretty well, and the basic func- 
tionality of the analog part was tested, but the production 
test did not catch any parameter variations that would cause 
the phase-locked loop to fail to lock. To address Ibis concent, 
four additional lesls were developed for later phase-locked 
loop designs, and three of these tests have been incorporated 
into the final production test. 

Lock Test. A test was needed Ihal would give some indication 
that the phase-locked loop was able to lock. To accomplish 
this, a new test was Created. This test checks an internal 
signal in the phase-locked loop that indicates that Ihe loop 
Is close to being locked. This signal is one of the four test 
points thai appear in Ihe phase-locked loop design. To ran 
this test, the loop is placed in one of its fixed frequency 
modes and allowed to lock. The internal lock signal is then 
checked to be sure ihal ihe loop has locked. 

Clock 1 • Test, i )ther ways (if testing that ihe phase-locked 
loop was able to lock were also investigated. One test in- 
volved selling the loop into a frequency mode in which its 
output frequency was equal lo Ihe inpul frequency (or some 
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even multiple). Then the loop was allowed to lock. After the 
loop had locked. Ihe output divider was reset to make the 
phase predict al ilc. and the clock output was Strobed during 
the clock low time for 1(10 cycles. Then ihe output was 
strobed during the clock high time for 100 cycles. The idea 
was that if l Ik* phase-locked loop was not at the correct fre- 
quency, the phase would drill and I here would be si robe 
failures. This lest did not work very well in practice. When 
Ihi' loop was put inlo a chip and simulated, the difference 
between fast and slow delays made it impossible to find a 
good time to strobe Ihe output clock. As a result, this test 
had to be dropped from the production test 

Leakage Test. In one ASIC, the phase-locked loop circuit was 
suspected of causing an excessive production line failure 
rate. After some investigation. Ihe filler of the phase-locked 
loop was identified as ihe cause of these failures. Manufac- 
turing defects in Ihe large resistors and capacitors caused 
excessive leakage, changing Ihe characteristics of the filler 
and preventing proper operation of the phase-locked loop. 
To screen out these defects, a new test was developed to 
Check the filler for excessive leakage. In this lest, the filter's 
analog signal is multiplexed oul to a dedicated test pin and 
the charge pump circuit is turned off by means of a control 
bit. A voltage of 3.3 volts is applied to Ihe pin and a curreni 
measurement is done with ihe tester's precision measuring 
unit. Parts with excessive leakage current are discarded. 
The implementation of this and other phase-locked loop 
tests has resulted in a !.)8% reduction in the line failure rate 
due to the phase-locked loop. 

Charge Pump Tests. Another circuit nm adequately tested 
in ihe original phase-locked loop was the charge pump. To 
remedy this a new test was developed to ensure that the 
charge pump was able to source and sink current within ils 
design specificalions. To allow for this test, new circuitry 
was added to disable either the M or \ counter in Ihe digilal 



lest mode. In Ihe test, the chip is put into the digital test 
mode with the N counter disabled. A clock is applied to the 
M counter and the M counter is allowed to count until three 
pulses have occurred on its output. At this point, the UP signal 
will be high and the DOWN signal will be low. The dock is 
slopped at this point, a voltage of 1.8V is applied to an ana- 
log test pin at the output of the loop filter, and the current 
sourced by the charge pump is measured and checked 
against the expected current. The same test is repeated, this 
time disabling the M counter and allowing the N counter to 
count until three pulses have occurred on its output. At this 
point, Ihe DOWN signal is high and the UP signal is low. A volt- 
age of 1.3V is applied to the analog test poini and the current 
drawn In the charge pump is measured. 

Summary 

We have greatly streamlined the procedure for designing, 
integrating, and testing a new phase-locked loop. In many 
cases, changes to the phase-locked loop can be made by 
resynthesizing and rerouting the digilal block and then per- 
forming a stability analysis. The customer has access to a 
detailed model of the phase-locked loop in either VHDL or 
Veiilog. and can use this to test the proper connection of the 
phase-locked loop and Ihe surrounding system. Production 
tests can be debugged with simulation before they are put 
onto a tester. Finally, Standard phase-locked loop tests have 
been written that run on OUT digital tC tester. These tests 
increase the test coverage for the phase-locked loop, in- 
creasing the quality of products shipped with these phase- 
locked loops in them. 
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Analog Behavioral Modeling and 
Mixed-Mode Simulation with SABER 
and Verilog 

A description is given of specific analog behavioral modeling and 
mixed-mode simulation techniques using SABER and Verilog. Full-channel 
simulations have been carried out on a class I partial response maximum 
likelihood (PRML) read/write channel chip. Complex analog circuits such 
as an adaptive feed-forward equalizer, an automatic gain control block, 
and a phase-locked loop are modeled in detail with the SABER MAST 
mixed-signal behavioral modeling language. A simulation speedup of two 
orders of magnitude has been achieved compared to SPICE. 

by Ben B. Sheng. Hugh S.C. Wallace, and James S. Ignovvski 



For more iluui two decades, the analog IC design community 
has been relying on variations of the original Berkeley SPICE, 
introduced in the 1970s, as the simulation lool for verifying 
and fine-tuning analog designs. Over Ihe years, many en- 
hancements have been put into these differeni flavors of 
SPICE, while increasingly more powerful computers have 
been used for running these circuit simulations. However, 
SPICK remains a low-level circuit simulator. II produces 
accurate results, but is inherently slow. Today's analog and 
mixed-mode designs are becoming increasingly complex. 
Functional simulations for larger mixed-signal designs are 
impractical with SPICE. Meanwhile, as Ihe pressure in- 
creases for low-cost, high-integration ASICs ("systems on 
a chip"), many analog functions are being integrated into 
largely digital chips. The need for new simulation methodol- 
ogies is becoming more urgent. 

In recent years, benefits from using analog and mixed-mode 
behavioral modeling languages have received increased 
recognition. The basic approach is to use a SPICE-like 
continuous-time simulator, which provides good accuracy 
in simulations, together with a fast digital simulator to give 
orders of magnitude faster digital circuit simulations. The 
modeling language is flexible so that designers can model 
analog subsystems in different levels of abstraction. The 
modeling language gives designers control over the trade-off 
between simulation speed and accuracy. 

This paper presents some of Ihe boitom-up modeling tech- 
niques and Simulation approaches that have been adopted 
during the process of modeling and simulating the read-write 
channel chip for an HP DDS-3 DAT drive. 

Analog Behavioral Modeling 

The idea of behavioral modeling is not new to analog design- 
ers. Macro models have been widely used by SPICE users. 
The newer-generation mixed-mode circuit simulators, such 
as SABER by Analogy, Inc. and SPECTRE by Cadence De- 
sign Systems, Inc.. have greatly enhanced designers' ability 



to model analog and mixed-mode circuits and systems by 
providing a flexible behavioral modeling language. With this 
modeling language, a designer can behaviorally describe an 
analog or mixed-mode device or subsystem at whatever level 
of abstraction is appropriate for a given simulation accuracy- 
versus-speed trade-off. One can use this modeling language 
lo write BSIM models for MOS transistors and use these 
BSIM models lo achieve simulation results that are as accu- 
rate as those from SPICE simulations. The same modeling 
language can be used to describe an analog-io-digilal con- 
verter (Apt ') behaviorally. without having In refer to any of 
its internal circuit elements. 

Several modeling approaches are discussed in this section. 
Based on Ihe scopes of these differeni approaches and their 
simulation speed-versus-accuracy Irade-offs, they can be 
categorized as eilher high-level, medium-level, or low-level 
modeling. In the following subsections, specific examples are 
given for high-level and medium-level modeling. Although 
low-level modeling is a very important part of analog model- 
ing and simulation, the techniques used lo do low-level mod- 
eling arc very similar tp those used in higher-lev el modeling. 
The only difference is that these techniques are used lo 
model much smaller devices, such as MOSFETs, diodes, and 
bipolar junction transistors. For brevity, discussion of low- 
level modeling is omitted. 

High-I.evel Modeling 

High-level modeling refers to behavioral models thai describe 
large analog and mixed-mode subsystems in a high level of 
abstraction. This approach provides the fastest simulation 
speed but Ihe least detail in Ihe circuits lhal are modeled. 

An ADC can be modeled with a clock input signal that trig- 
gers each conversion, an analog input signal, an analog ref- 
erence signal, and digital outputs. In addition to ihis basic 

structure, some realistic behavior can be included in the 

model. For example, Ihe model can include characteristics 
such as differential uonlinearily, integral nonlinearily, and 
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1 template adc elk In ref b2 bl bO = tau, td, dnl, Inl, vmax 
{ 

# Capacitive input load of 1 pF 
c.input_load in 0 = lp 
5 # Compute a random integral nonlinearity 
number random_inl = ( 2 *rand ( ) - 1 ) * inl 
when (event_on( elk, clk_last) ) { 

# Looking for rising edges in elk. 
if (elk == 14_1 & clk_last == 14_0) { 
10 schedule_event (time, b2, 14_x) 

schedule_event (time, bl, 14_x) 
schedule_event (time, bO, 14_x) 

# Tell the analog simulator to step on the clock edge to get accurate 

# analog signal value 
15 schedule_next_t ime (time) 

# Sample input voltage (with some adjustment for correct zeroing) 

# and reference voltage 
vref ■ v(ref) 

vin = v(in) + vref/ 8 
20 # Error checking 

if (vref < 0) error ("The voltage reference to the ADC is negative") 
if(abslvin) > vmax) error ("The ADC input signal is out of range") 

# Determine the sign bit 
if (vin < 0) d2 = 0 

25 else d2 =1 

# Add random differential nonlinearity 
vin = abs(vin) + ( 2*rand ( ) -1) »dnl 

# Clipping the output 

if (vin > vref) vin - vref 
30 # Add random integral nonlinearity 

vin = vin ♦ random_inl* ( vref - abs (vin) ) *abs (vin) *4/vref /vref 
if (vin < vref/2) dl = 0 
else { 
dl = 1 

35 vin = vin - vref/2 

) 

if (vin < vref/4) dO = 0 
else { 
dO = 1 

40 vin = vin - vref/4 

} 

# Compute resolution time. If td+t_resolve > 1-clock-cycle, then the ADC is 

# in a metastable state (a conversion error occurs) 
t_resolve = tau*ln (2/ (abs ( vin) +lu) ) 

45 if (d2 == 1) schedule_event(time+td+t_resolve, b2, 14_1) 

else schedule_event (time+td+t_resolve, b2, 14_0) 
if (dl == 1) schedule_event (time+td+t_resolve, bl, 14_1) 
else schedule_event ( tirae+td+t_resolve, bl, 14_0) 
if (dO == 1) schedule_event (time*td+t_resolve, bO, 14_1) 

50 else schedule_event (time+td+t_resolve, bO, 14_0) 



Fig. 1. A behavioral 3-bit ADC model written in the SABER MAST modeling language. 



metastability characteristics. A behavioral 3-bit ADC model 
written in the SABER MAST modeling language is shown in 
Fig. 1. 

As can be seen in this example, high-level modeling can be 
used in describing analog and mixed-mode subsystems, with 
some detail included. This particular approach is suitable 
for functional simulations of large systems. A large number 



of functional simulations can be carried out quickly, but cir- 
cuit details are often omitted. This type of high-level model- 
ing can speed up simulations by at least three orders of mag- 
nitude compared to SPICE, at the cost of not being able to 
simulate the fine details in the circuits. 
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For complex mixed-signal designs, chip-level connectivity 
verification is often a problem, since neither traditional ana- 
log simulators such as SPICE nor pure digital simulators 
such as Verilog can. for example, check voltage reference 
levels or common-mode levels of differential signals. One of 
the most important benefits of using high-level models is the 
ability to verify top-level circuit connectivity when the final 
chip is composed. One specific example is given in the ADC 
model of Fig. L Line 21 checks that the reference voltage to 
the ADC is not connected incorrectly (negative reference). 

Another key point is the ability to do analog assertion. 
Traditional graphical analog postprocessors work well if 
there are a manageable number of signals and time windows 
to look al — in other words, when dealing with simulation 
results of a relatively small circuit. For system-level simula- 
tions, in which multiple complex analog blocks interact with 
each other and hundreds of signals are changing constantly, 
it becomes very difficult to track all the important signals to 
make sure that all Ihe circuits are operating within their 
specified parameters. The analog modeling language allows 
designers to put specific assertions in the models to monitor 
the analog signals and give warning messages or even abort 
simulations when the model is operating outside of a set of 
prespecified parameters. An example of such an assertion 
can be seen in line 22 of Fig. 1, where the input signal is 
compared to a prespecified maximum level. 

The ADC model not only evaluates signals in the analog do- 
main, but also schedules digital events. As mentioned earlier, 
the newer-generation simulators not only provide SPICE-like 
analog simulation engines, but also have built-in digital sim- 
ulators, which give them much improved performance in 
simulating mixed-mode systems. Built-in digital simulators 
are orders of magnitude faster in simulating digital circuits 
than a SPICE-like simulator. We will discuss mixed-mode 
simulations in more detail later. 

Medium-Level Modeling 

Medium-level modeling refers to behavioral modeling of 
smaller building blocks, such as an operational amplifier 
(op amp), a multiplier, an integrator, or a comparator. These 
circuits typically contain a few dozen transistors. Models for 
these circuits can have more physical characteristics, which 
track SPICE simulation results. 

For modeling the read/write channel chip, we used medium- 
level models most extensively. Three different techniques 
were used to develop the models, based on the different 
circuit structures. 

The first approach is to create generic building blocks with 
flexible parameters that can be used to customize these ge- 
neric models when they are used in different applications. A 
good example for this approach is the model for an op amp. 
Characteristics of a generic op amp include dc gain, pole 
location (if a second pole location is Important, one can sim- 
ply connect two one-pole models in series), zero location, 
input impedance, input capacitive load, output impedance, 
output capacitance, output common-mode level, slew rate, 
output swing, and nonlinear distortion. More comprehensive 
characteristics can be added to this list when needed. Fig. 2 
shows an example of such a generic op amp model. Some of 
the parameters listed above are omitted for brevity. 



This type of generic circuit model can be made flexible 
enough to satisfy a variety of modeling and simulation 
needs. Additional characterization can be included, when 
appropriate, at relatively low simulation speed cost. 

In some cases, when the system performance is highly sensi- 
tive to the modeling accuracy of a small block, a curve-fitting 
approach can be used. This approach is suitable for small 
circuits with few" input ports. Outputs of a circuit can be 
empirically fit to carefully chosen mathematical functions. 
These models are fully customized for the given circuits. 
They can be made very accurate in representing a few key- 
characteristics of some small blocks, but they are not reus- 
able for modeling other circuits, and can become very com- 
plex when more input/output signals are added to the equa- 
tions. Designers who take this approach must be careful to 
produce a model that covers the entire input signal range to 
prevent the models from producing erroneous simulation 
results. 

The third approach taken in modeling circuits in the read/ 
write channel chip is to use the first-order MOSFET equa- 
tions to approximate circuit behaviors. A good example is a 
MOSFET switch. The on-resistance of a MOSFET switch can 
be approximated with the first-order equations accurately 
enough for most applications. The switches are effectively 
modeled as nonlinear resistors. If a more accurate model is 
needed, the curve-fitting method can be used. 

Mixed-Mode Simulations 

SPICE is an integration-based simulator. Although it gives 
good simulation accuracy, it is inherently slow, since it has 
to set up and solve a nonlinear system matrix for the entire 
circuit at every integration time point. The integration time 
step is determined by the largest signal change in the circuit, 
regardless of whether it is a digital signal or an analog signal. 
It is impractical to do system-level simulation with such an 
algorithm when the system has more than a few thousand 
transistors. 

To run system-level simulations on a complex mixed-signal 
design and get reasonable accuracy, it is necessary to use an 
event-driven digital simulator to simulate the digital portion 
of the design, together with an integration-based analog sim- 
ulator to simulate the analog circuits. The newer-generation 
mixed-mode simulators provide designers with both digital 
;md analog simulation engines in one simulator. Continuous- 
time analog signals are computed by the analog simulator, 
while the digital simulator evaluates the event queue. As 
illustrated by the ADC example, designers can specify the 
interactions bet ween the digital and analog domains. These 
interactions are resolved with the relaxation method. 1 The 
computational overhead for resolving these interactions is 
minimal, since the interactions are by nature discrete events. 
In addition. Ihe built-in digital simulators make it easy for 
the mixed-mode simulators to cosimulate wit h widely used 
external digital simulators, such as Cadence's Verilog-XL. 
Since the mixed-mode simulators already create digital 
event queues, they can share their event queues with an 
external digital simulator's event queue, and vice versa. The 
ability to cosimulate with well-established digital simulators 
makes it possible to use all of the available digital libraries, 
which are often built based on careful characterizations and 
can provide accurate simulation results. 



© Copr. 1949-1998 Hewlett-Packard Co. 



April 1987 Hewlett-Packard Jounal 91 



1 template opamp lnp inm outp outm van vdd gnd pd ■ ci, gm, gm3, vswing, ro, co, rz, imax 
{ 

c.cip inp 0 = ci 
c.cim inm 0 = ci 
5 values { 

# Define the block's inputs 
vint = v( inp, inm) 

vinp = v(inp) 
vinm = v(inm) 
10 if (v(inm) > v(inp)) { 

vinp = v(inm) 

vinm = v(inp) 

} 

# Gain and 3rd order nonlinear distortion 
15 idiff a gm*vint + gm3*vint**3 

# Output swing, the output clips at "vswing" 
iclip_up = {vswing - v (outp, outm) ) /ro/lk. 
iclip_dn = (vswing - v(outm,outp) ) /ro/lk 

if (v(outp,outm) > vswing) { 
20 iclip_up = iclip_up*10meg 

) 

else if (v (outp, outm) < -vswing) { 
iclip_dn = iclip_dn*10meg 

) 

25 idiff = idiff + iclip_up - iclip_dn; 

# Slew rate 

if (idiff > imax) idiff = imax 
if (pd -= 14_0) idiff = 0 

) 

30 equations { 

# A pole and a zero 

icapp = d_by_dt ( ( v ( outp ) - icapp*rz ) *co) 
icapm = d_by_dt ( (v(outm) - icapm*rz ) *co) 
i(outp) -= ioutp 
35 ioutp: ioutp = idiff - v(outp, vcm) /ro - icapp 

i(outm) - = ioutm 

ioutm: ioutm ■ -idiff - v (outm, vcm) /ro - icapm 

} 

} 



Fig. 2. An example of a generic op amp model. 

Fig. 3 shows the functional block diagram of the read/write 
channel chip. The shaded blocks indicate analog functions 
that were modeled with a behavioral analog modeling lan- 
guage. The rest of the chip (approximately 40%) is digital, 
including all of the system timing and control functions, a 
maximum-likelihood sequence detector (MLSD or Viterbi 
detector), anil much of the phase-locked loop. These digital 
circuits were designed with Yerilog HDL and synthesized 
with Synopsys using HP CMOS34 ( 1.0-|im CMOS) standard 
cells. For the final system-level simulations, the digital cir- 
cuits were simulated with Verilog and behavioral models of 
the analog circuits were simulated with SABER. Special 
interface models were built to establish connections be- 
tween the two simulators. 

System-Level Simulation Example: Read/Write IC 

The read/write channel chip is a mixed-signal IC with high 
analog circuit content. Approximately 60% of the core area 
is analog in nature. Because of the complexity and size of 
the analog circuits, many typical design considerations be- 
came more challenging. Simulating the interfaces and inter- 
actions between analog blocks as well as between the ana- 
log blocks and their digital control blocks w as critical for a 



successful design. Translation of system-level specifications 
into meaningful circuit block design specifications was diffi- 
cult, and probably most important of all, managing the sys- 
tem-wide budget of circuit nonideal behaviors was a formi- 
dable task. Ultimately our IC design team, working closely 
with the system design team from HP's Computer Peripherals 
Bristol Division, developed a simulation strategy of linking 
and correlating the simulation results of several tools to 
provide a closed loop of circuit and system verification. 

In the initial step of the design process, the customer speci- 
fied circuit performance by building a system model using C 
and Matheinatica that w as partitioned in a fashion consistent 
with the circuit partition. Much of the read/write channel 
chip's analog signal path was based on an analog PRML 
(partial response maximum likelihood ) read channel chip 
that was developed for disk drive applications, so there was 
some previous work that was leveraged in architecting the 
read/write channel chip. As functional descriptions of circuit 
operation were put into the system model, specific limits 
and design goals could be identified. This information drove 
the design of the analog circuit blocks, which were then 
simulated in SPICE. The detailed results of the individual 
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block simulations were incorporated back into the system 
model and real-world, nonideal circuit behaviors, such as 
offsets, nonlinearity, and PVT (process, voltage, and temper- 
ature) dependencies, were introduced into the system model. 
This increased level of detail in the system model led to very 
clear and well-defined specifications for the analog circuit 
blocks. This linking of system performance to circuit block 
performance allowed us to determine overall channel peri 
formance and define criteria for deciding when the design 
had acceptable margin and was ready for tape release. 

In parallel with the system simulations, high-level simulations 
using SABER and Verilog were being run with analog block 
models. The high-level simulations performed functional 
and timing checks of the digital control blocks and their 
interfaces to the behavioral representations of the analog 
blocks. In this way, the operation of analog circuitry under 
the control of complex state machines was verified. The 
results of high-level simulations of the analog portion of the 
system were compared to the results of the Mathematica 
system model for verification. Analysis and debugging of 
the analog behavioral simulation results were valuable for 
modeling and for confirming observations from the test and 
characterization of first silicon. 

Critical Analysis 

The read/write channel chip was our first major chip design 
for which high-level system design and verification were 
extremely critical to the chip's performance. Many previous 
mixed-signal designs depended on just the individual func- 
tional blocks' meeting the specifications to ensure that the 
final chip would perform to the system specifications. For 
the read/write channel chip, the system performance and 
trade-offs could only be evaluated after much of the actual 
circuit design was completed. The system performance of 
the read/write channel chip was extremely sensitive to a 



Fig. 3. Functional block diagram 
of the read/vvrite channel chip. 
The shaded blocks indicate ana- 
log functions that were modeled 
with a behavioral analog model- 
ing language. 

number of nonidealities in the signal path, so multiple itera- 
tions were carried out between careful circuit characteriza- 
tions and system performance evaluations. 

After the project was finished, and with an increasing need 
to define formally a methodology for new mixed-mode sig- 
nal processing chips, the strengths and weaknesses of the 
read/write channel chip methodology were evaluated. 

The high-level SABER models verified that the digital control 
circuitry functioned correctly with the analog blocks. This 
contribution cannot be overstated. The number of signals 
together with the complex interaction between the digital 
and analog blocks cannot be checked adequately in any other 
manner. The behavioral modeling of the analog blocks also 
discovered problems within the analog circuitry. While many 
of the main related analog blocks were simulated in SPICE 
together, there were others that were not, because of size and 
speed limitations in SPICE. The netlist extraction for the top- 
level SABER simulations was automatically generated from 
the chip transistor schematic, so the high-level simulations 
gave good confidence that the chip was correctly connected. 

The main weakness in the process was that one engineer 
was responsible for developing all the analog behavioral 
models. This was because of manpower constraints in this 
project and the new introduction of the SABER tools. The 
investment in the learning curve for the new tools could not 
be absorbed by this project. The process that was followed 
to develop more than 100 behavioral models was examina- 
tion of the schematics and SPICE plots that were available, 
and communication with the block designers. This worked 
fairly well in most cases, but there were some instances of 
incorrect behavioral modeling. The main problem was that 
human interpretation was needed to create the behavioral 
models. Characterization tests to compare the circuits with 
the models could have helped, but with so many blocks, 
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some being continuously modified, a full sol of characteriza- 
tion tests was not practical with the given amount of time 
and resources. 

Enhanced Methodology 

An enhanced methodology based on the one described 
above has been developed. This new methodology is in- 
tended to eliminate the weaknesses described in the read/ 
write channel chip mixed-mode methodology by leveraging 
as much as possible from the behavioral synthesis method- 
ology used in digital VLSI designs. The key is to remove as 
much human interpretation as possible and use the test vec- 
tor set as a common link from customer definition to final 
chip simulation. Since not all the specifications in a mixed- 
mode design can be encoded with traditional digital test 
vectors, many of the analog specifications can only be evalu- 
ated by specific analog tests. The key to these tests being 
successful is to develop them from the customer's written 
specifications or the circuit design engineers - knowledge 
when the behavioral model is developed. The intention is 
thai the block designer will develop the behavioral model. 
By taking this approach and developing tests that will run 
on both I he real schematic and its behavioral model, a set 
Of characterization tests can be exercised on the behavioral 
model. By sianing a block design with a behavioral model 
based on written specifications and a top-level customer 
model, a more speoificat ion-driven evaluation of each cir- 
cuit block can be used to determine whether the circuit will 
perform its desired function. This makes it possible to delay 
the exact circuit implementation until the full requirements 



of the block are understood. It could, for example, make the 
difference in deciding whether a block should be designed 
in the analog or the digital domain. 

The approach of having the block designers develop the 

behavioral models still requires human interpretation and 

complete understanding of the circuit. Although Ibis ap- 
proach is an Improvement of the methodology, the human 
interpretation is still a weakness. A number of simulators 
now gaining wide acceptance in VLSI digital design allow 
the whole chip to be converted Into a FET-level simulation 
The simulation speed is much faster ( 10 x ) than a SPICE 
simulation, with apparently improved convergence. The 
improvement in speed is obtained by running many parts of 
the circuit linearly. This approach compromises accuracy 
somewhat. The key to this new way to simulate mixed-mode 
systems is to define which blocks need to be simulated in a 
Sl'K'E-like mode to ensure the required accuracy. This whole- 
chip simulation can only be performed when a complete FET 
schematic is available, so it is seen as a final verification 
check using the main mixed-mode simulation vector set. 
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Physical Design of 0.35-|Lim Gate 
Arrays for Symmetric Multiprocessing 
Servers 

To meet gate density and system performance requirements for the 
HP Exemplar S-class and X-class technical servers, a physical design 
methodology was developed for 1.1 -million-raw-basic-cell. 0.35-Lim 
CMOS gate arrays. Commercial and ASIC vendor-supplied tools were 
augmented with internally developed tools to put together a highly 
optimized physical chip design process. 

by Lionel C. Bening, Tony M. Brewer, Harry D. Foster, Jeffrey S. Quigley, 
Robert A. Sussman, Paul F. Vbgel, and Aaron W. Wells 



This article provides a broad overview of the gate design 
and layout of 0.35-um gate arrays for the new S-class and 
X-class members of the Hewlett-Packard Exemplar techni- 
cal server family. The design process was built around third- 
party tools, but several internally developed tools were 
needed because commercial offerings were insufficient or 
unavailable. Generally, these tools automated the generation 
of the design or dramatically improved the logistics of the 
flow. Without these internally developed tools, meeting den- 
sity and speed objectives (the optimization of the design) 
would not have been possible in acceptable Calendar time 
with the design staff available 

In-house place-and-routc tools played a key role in this de- 
sign flow, As in previous projects, a close working relation- 
ship was developed with the gate array vendor. Among other 
benefits, this allowed the design staff to use the GARDS 
placement and routing software from SVR [formerly Silvar 
Lisco). Most of the internally developed tools written for 
the project operate directly on place-and-route information. 
Examples include a floor planner that understands design- 
specific routing obstructions, custom clock tree generation, 
and placement-based resynthesis of the gates. 

Server Architecture 

The target servers are symmetric multiprocessing systems 
built around the HP PA 8000 processor. S-class maciunes 
(also called nodes) connect 16 processors together to form 
a single system. X-class machines connect multiple nodes 
together to form a single system. X-class hardware can 
create a system containing up to 120 nodes (1920 proces- 
sors). The PA 8000s will initially run at 180 MHz. with the 
rest of the system running at 120 MHz. Except for the PA 
8000 and associated SRAMs and DRAMs, the bulk of the 
system logic is implemented in Fujitsu CGG1 0.35-um gate 
arrays, ;is shown in Table I. One additional gate array is im- 
plemented in the much less expensive CG51 0.5-UJtl process. 
This chip shared all the same tools and design flow as the 
0.35-um gate arrays. 



Table I 

Gate Arrays for S-Class and X-Class Servers 



Chip Name 


All Logic 
(Basic 
Cells) 


Random 
Logic 
(Basic 
Cells) 


Latch 
Array Bits 


Base 
Type 


Processor 
Interface 


550 k 


317 k 


28 k 


0.35 am 


Crossbar 


500 k 


206 k 


29 k 


0.35 urn 


Memory 
Interface 


570 k 


358 k 


25 k 


0,35 um 


Node-to-Node 
Interface 


300 k 


175 k 


43 k 


0.35 itm 


I/O Interface 


150 k 


106 k 


6.4 k 


0.5 tun 



The characteristics of the Fujitsu CG61 0.35-um CMOS gale 
array are as follows: 

• 1.1 million raw basic cells. One basic cell can implement 
one low-power, two-input NOR gate or one medium-power 
inverter. 

• Overall die size: 13.5 by 13.5 mm. Core size: 11.7 by 11.7 mm. 

• 4 routing layers. 

• 1.75-um routing pitch on metal layers I, 2, and 3; 14-uni on 
metal layer 4. 

• 560 I/O signals. 

• Flip-chip connection to package. Signals attach at periphery 
Power and ground attach on a uniform grid across the die. 

• Macros* generally had several functionally equivalent ver- 
sions with different drive strengths. 

• Target 45% utilization of raw basic cells in random logic- 
regions. 

• Many paths with 14 macros between registers at 120 MHz. 

• In this content, a macro Is a collection ol basic cells that implement a primitive function like a 
multiplexer or register 
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Fig. 1. Simplified physical design (low. 



Custom I/O macros were developed to meet the stringent 
system electrical and timing requirements. The board de- 
signers analyzed all system paths with SPICE. SPICE models 
of the board nets included gate array driver and receiver 
macros. 

The static flow is depicted in Fig. 1. The static flow does not 
reflect the actual day-to-day activity of the design process. 



Many iterations and subiterations were made until a chip 
could flow cleanly without timing or routing problems from 
start to finish. In addition to feeding back problems into the 
flow — for example, to update Synopsys constraints or make 
behavioral model changes — the design staff refined tools 
and techniques along the way. Rapid iterations through the 
place-and-route flow were vital to this process. 
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Typically, four designers worked on the larger arrays. One 
person worked almost exclusively on the floor plan, layout, 
and routing. The other designers, in addition to all other 
design duties, iterated through Synopsys and floor plan 
timing. Four full-time CAD engineers built and supported 
the flow. 

RTL Behavioral Model Styles 

The S-class and X-class server gate arrays were described 
in KTL-level Verilog (RTL stands for Register Transfer Lan- 
guage i. These models were considered to be the "golden" 
models and were maintained throughout the life of the proj- 
ect. Design verification used the RTL models exclusively. 
The Boolean equivalence tool (lover i guaranteed (hat the 
RTL Verilog and gate Verilog were equivalent. The RTL 
Verilog style varied depending on the tools and the tasks 
for which il was written. Thus, the style was driven by the 
needs of Synopsys. the Boolean equivalence tool, and high- 
speed RTL Verilog simulation. 

Style Driven by Synopsys. Synopsys was the primary tool to 
map RTL behavioral models to gates. Anyone familiar with 
this tool knows the requirements il imposes on the coding 
style: many small modules with lots of hierarchy, instantiated 
gates in the RTL. and code almost ai the equation level The 
project had two methods of instantiating gates in the RTL: 

• Macro Group Specification. Based on previous projects' 
difficulties in synthesizing multiplexer macros, all scan reg- 
isters and multiplexers were explicitly specified in the RTL. 
To make the RTL more readable and to improve RTL simula- 
tion speed, groups of macros were bundled under one level 
of hierarc hy. For instance, 32 high-drive 2-to-l multiplexer 
macros would appear in the RTL as: 

MUX21__HI_32 muxes ( 

.S (<32 bit port connection)) , 
.10 (<32 bit port oonnection> ) , 
.11 (<32 bit port connection) ) . 
.0 (<32 bit port connection)) 

1/ 

An internally developed tool read the RTL Verilog and gener- 
ated the corresponding modules' definitions based on a ven- 
dor-specific mapping of the generic type to the actual type. 

• 'ifdef GATES. Frequently, designers wanted to specify explicit 
macro types for some lines of code in a synthesized RTL 
module. Hie module would be written with: 

■ifdef GATES 

<macro instance specifications) 
'else 

<behavioral of the above gates) 
' endif 

For example: 

'ifdef GATES 
wire tl, t2, t3; 

X0R3 xortreel (tl, in<0>, in<l), in<2)); 

X0R3 xortree2 (t2, in<3), in<4>, in<5>); 

X0R3 xortree3 (t3, in<6>, in<7), in<8>); 

X0R3 xortree4 (perr, tl, t2, t3); 
'else 

perr = A in; 
'end 

The GATES lexl macro was defined when synthesizing. This 
prescivcd clarity and speed of simulation in the RTL while 



at the same time generating the correct gate design in Syn- 
opsys. The Boolean equivalence tool ensured that the gate 
descriptions matched the behavioral descriptions. 

Style Driven by Boolean Equivalence Verification. The Boolean 
equivalence tool (see "Formal Verification — Boolean Equiva- 
lence" below) logically compared the RTL behavioral model 
and the gate level netlist. This tool partitioned the design 
into cones of logic based on equivalence points, which are 
essentially net names that exist identically in both models. 
At a mimmum. all register outputs are equivalence points. 
To keep the comparison tool simple, each equivalence point 
had to match 1-to-l between the models. This simplification 
required that registers replicated for fanout purposes in the 
gate netlist be replicated identically in the RTL Verilog. 

Style Driven by Simulation Performance. Simulation perfor- 
mance was an important factor in each gate array's design 
quality and its development schedule. Faster simulation 
meant that we could get to a functionally correct gate array 
physical design more quickly. Thus, some of the RTL style 
methodology was driven by simulation performance. 

For all of the hand-instantiated miscellaneous logic prefixed 
by 'ifdef GATES, the designers included (he corresponding 
behavioral statement(s) bracketed with else ... endif. Also, 
guidelines were dev eloped that pointed designers towards 
behavioral Verilog code constructs that simulate faster, and 
a liming tool was developed that pointed out where they 
could use these faster constructs in their Verilog. F.xamples 
included: 

• Use of a mask plus a unary operator in place of the corre- 
sponding binary logic operation on selected bits of a bus, as 
in error correction bit calculations 

• Use of a temporary variable in place of frequently referenced 
bit positions 

• Concatenation in place of subrange assignments and shifts 
by constants. 

In addition to simulating with vendor Verilog simulators, an 
in-liouse Verilog-to-C translation tool was developed to gen- 
erate cycle-based models for each gale array type. The 
resulting simulations ran five to twelve times faster than the 
same simulations on vendor simulators.' 

Because the Verilog language is extensive and expensive to 
support in its entirety, and because the internally developed 
tools were never intended for wide use beyond the bound- 
aries Of the project, the cycle-based Verilog-lo-C translation 
tools supported only a subset of behavioral Verilog that, is 
Sufficient f" r cycle-based simulation. Where there were sev- 
eral different ways of describing the same design in Verilog, 
the tools supported the style that was prevalent among de- 
signers and less expensive to parse. Time constraints forced 
tool development to leave out constant expressions, loops, 
parameters, and variables as bit subscripts from the sup- 
ported behavioral Verilog language style. 

Latch Array Compiler 

The latch array compiler allowed the designers to create 
new configurations of memory blocks easily. The memory 
blocks were implemented as a row of latches per dala bit , 
With a column of latches selected by a read or write address. 
All latch arrays had a dedicated write port and either one or 
two dedicated read polls. All address, dala in, and dala out 
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polls of llie latch array had registers to isolate the internal 
timing of the latch array from the surrounding logic. Option- 
ally, parity going into or coming out of the latch array could 
be checked. The placement locations of the write anil read 
address ports were fixed, but the data input and output 
ports could be independently placed to the left or right of 
the core area. 

The compiler's output consisted of the RTL behavioral and 
gate Verilog models for the configurations, as well as the 
relative placement information. This placement information 
was then given to the floor planner so that the latch arrays 
could be relocated to any desired site. 

There were 16 to 41 latch arrays per design (118 total for 
four designs), with each latch array storing between -18 and 
21 12 bits, for an approximate total of 25K to -13K bits per 
chip, representing 35% to 60% of the basic cells used for 
each design. Through the use of custom macros designed 
specifically for the project, the core area of the latch arrays 
achieved over 87% utilization. For a single-read-port array, 
the core area consumed about 5.6 basic cells per bit, while 
the core area for a dual-read-port array used about 9.6 basic 
cells per bit. The core of the latch array was routed com- 
pletely in metal 1 and 2, allowing the metal 3 over the core 
area to be used for normal routing. 

Synopsys 

Synopsys was used as the primary means to map nonregister 
and nonmultiplexcr RTL to gates. The designers did not rely 
on Synopsys to generate the final gates. The designer's goal 
was just to get close to the final liming and logic area and 
then use the resynthesis step described later to improve the 
gates. There are two main reasons for this. First. Synopsys 
cannot read and optimize an entire design in a practical 
amount of time, and second, Synopsys cannot synthesize 
fanout trees based on the actual placement of macros. 

Constraints for Synopsys were generated by hand. 

Formal Verification — Boolean Equivalence 

Formal verification using an internally developed Boolean 
equivalence verification tool (lover) eliminated a tremendous 
amount of gate level simulation. In fact, the system simula- 
tion remained at the RTL level throughout the entire project. 
Boolean equivalence verification, unlike simulation, guaran- 
tees complete results, since it uses mathematical formal 
proof techniques to verify logical equivalence, lover ran in 
three fundamental modes: RTL-to-RTL. RTL-to-gate. and 
gate-to-gate. 

RTL-to-RTL comparison was critical when it became neces- 
sary to modify modules in the RTL to remove redundancies 
across module boundaries. Also, RTL-to-RTL comparison 
allowed the designers to explore different implementations 
of an equivalent design. 

RTL-to-gate comparison look a lot of pain away from doing 
hand-captured gates. Miscompares could be isolated down 
to a few gates and fixed quickly. Also, RTL-to-gate compari- 
son was useful as a revision control check b\ ensuring that 
any last-minute changes in the RTL source were synthesized 
into gates, or in other words, that design verification's RTL 
matched the actual netlist. 



Gate-to-gate comparisons ensured that the CAD system 
(resynthesis in particular) maintained logical correctness 
throughout I he How. In addition, gate-to-gate comparison 
was used to ensure that all hand edits performed during Ihc 
layout process were error-free. This allowed last minute 
changes to be made in the design with confidence. 

Maintaining a consistent Verilog naming convention through- 
out the CAD How facilitated the mapping of flat Verilog wire 
and register names to their hierarchical equivalents. This 
convention dramatically Improved the performance of lover 
by providing additional subequivaletice points. L'sing sub* 
equivalence points, large cones of logic whose boundaries 
consist of ports and registers can be broken up into smaller 
cones of logic automatically. 

A complete gate-to-gate verification of a 550,000-gate design 
ran on an SPP-1000 (an 8-way symmetric multiprocessing 
machine with 2G bytes of main memory) in under 20 minutes 
and required 1.2G bytes of physical memory (see Table II). 
A complete RTL-to-gate v erification of a 550,000-gate design 
was completed in 1 hour and required 1.2G bytes of memory. 
Most RTL-to-RTL subhierarchical Comparisons completed in 
under 2(1 seconds, lover reads and compiles the RTL source 
files, netlists, and libraries directly. These times represent a 
combination of the compilation and comparison processes. 



Table II 

Gate-to-Gate Boolean Equivalence Run-Time Performance 
and Memory Requirements 



Chip Name 

Processor 
Interface 

Crossbar 

Memory 
Interface 

Node-to-N'ode 
Interface 

IA ) Interface 



Logic Size 

(Basic Cells) Minutes Virtual Memory 



550 k 

500 k 
570 k 

300 k 

150 k 



20 

9 
20 

10 



I.2G byles 

0.9G bytes 
1.2G bytes 

LOG bytes 

0.3G bytes 



Floor Plan 

The floor plan tool (fpt) helped the designer generate (he 
placement for the major blocks of logic. The physical hier- 
archy of the design initially matched the logical hierarchy 
(a module in the source Verilog became a block for place- 
ment ). but by editing the placement definition file, the physi- 
cal hierarchy could be changed to anything the designer 
desired. 

fpt read the same placement obstructions as the gale place- 
ment tool, so an accurate determination of the utilization of 
each block could be made as it was placed. The tool also 
understood the concept of fixed-size blocks (such as the 
latch arrays) as opposed to the malleable blocks which were 
primarily composed of synthesized logic, and allowed the 
latter block types to be reshaped. 

Interconnect lines could be drawn between the block drivers 
and receivers, with the width of the drawn line representing 
the number of signals crossing the boundaries of the blocks. 
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Fig. 2. Processor interface gate array floor plan and placement views, (a) Floor plan tool (Ipt) view. The block name and effective per- 
centage utilization are shown. The effective utilization is based on the underlying obstructions used by the macro placement tools. Ob- 
structions are usually channelization, latch array, or I/O regions. The shaded boxes are latch arrays. Blocks can be overlapped. A density 
map can be displayed, which shows where overlapped areas result in density that may be too high to place efficiently. Floor plan blocks 
that, show similar effective utilization can actually be placed at different real utilization. For instance, csrjnput at 77% ( middle of die) and 
reqjlec, also at 77% (right side middle above queue jYi), show similar effective utilization. Compared to the actual placement, csrjnput is 
placed at lower real utilization because of the additional routing channels, (b) Placement view. In the placement view, the regions of 
custom channelization can be clearly seen. The latch array regions are also clearly visible. The actual size and number of the channels 
were determined by many iterations through the place-and-route flow, attempting to get a complete route Register and multiplexer 
macros clustered around the I/O sites were placed identically to control chip-to-chip skew. 



Inputs and outputs from a block could be drawn in separate 
colors to help illustrate the flow through the blocks. 

fpt could display a density map that quickly identified local 
hot spots of high placement density. There was also a mclhod 
to show information from a prior postrotite analysis so that 
the floor plan could be adjusted to provide more room for 
congested routing areas. 

Fig. 2a shows an example of a floor plan for one of the chips 
and Fig. 2b shows the placement view. 

Timing 

Tuning analysis was performed by an internally developed 
tool because a third-party tool with the necessary features 
and performance was not available and resynthesis (see 
below) required t ight coupling of the timing analysis to the 
automatic netlist editor. Each step in the flow (floor plan, 
placement, routing) used the same timing algorithm and 
timing tool. Differences in delays at each step resulted solely 
from the accuracy of the n-models and each macro input 
pin's T|i„ e (RC delay). (A n-model represents the net + gate 
load as seen by the macro output.) For different steps in the 
design flow, lite net jr-model and RC delays were calculated 
as follows: 



• Floor plan. Wire table lookup was used for sections of the 
net that were completely contained within a block. Sections 
that spanned blocks were psetidorouted. The composite was 
then reduced to the n-model and T|i„ e values. 

• Placement. A pseudoroute was constructed for each net and 
then reduced to the it-model and T\[ m , values. 

• Routing. A special route analysis program was developed to 
analyze all metal layers and generate the .t-model and Tn, IP 
values. 

Delay calculations were nonlinear and edge rate dependent . 
Presentation of edge rates in the critical path reports was 
very helpful in fixing slow paths. Slow edge rates were the 
first thing to look for (and fix) in a broken timing path. 

Two delay values and two edge rate values were calculated 
per macro level: 

• T KaIe is the input-pin-to-output-pin intrinsic macro delay. 
Tg a | P is a function of the timing arc through the macro, the 
input pin edge rate (TsfaO and the it-model on the macro out- 
put. There are several T gate calculations per macro, each 
corresponding to a different timing arc through the macro. 

• T S0 „| is the edge rate seen at the macro output pias. For each 
Tgaip calculation there was a matching T srmI calculation. 

• T| jnt . is the RC delay. Tu,„, is a function of the net topology- 
only. 
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Fig. 3. TimiuM comparison: floor plan versus placement. 

T sm is the edge rate as seen al the macro input pins. T sm is 
always greater than T soul of the drhing pin and represents 
the degradation of the signal edge as it propagates through 
the EC tree of the net. The edge rate degradation was a sim- 
ple function of T|j,„, and was not very aeciu'ate for compli- 
cated net topologies. 

Each gate array used tJie same simple clocking scheme. At 
most there were three different clock phases, which were 
generated from the same master clock pin of the chip. The 
clock phases are 1 x period, 2 x period (2a). and 2 x period 
with the opposite phase (2b). The simple clock scheme 
allows a simple forward traversal algorithm to sum the de- 
lays 'IV,,,,. and T|„„, io gel path liming. Macro output pin edge 
rates (TsouO were propagated and derated to the inpul pins 
(T sm ) during the same forward traversal. Typically, 1-»1. 
2a— »1, and 1— *2b paths were calculated in one pass. 1— *1. 
2b-»l, l-»2a paths were calculated in another pass, and 
2a-*2a and 2b-*2b paths were calculated in a third pass. 

Resynthesis 

Resynthesis can be thought of as an elaborate in-place opti- 
mization of the entire design. Resynthesis was used to re- 
duce or remove logic, build and place optimal fanout trees, 
and Change macro power levels to the minimum necessary 
to meet timing requirements. There were six steps in the 
resynthesis process: 

Flatten. The gate level Yerilog modules were read in. linked 
and flattened so that the entire design could be easily opti- 
mized across hierarchical boundaries. 
Rip/join. Buffers and inverter pairs were deleted, simple 
macros were joined together, logic was reduced through 
constant propagation, and unused logic was deleted. 
Coarse fix. Each macro output pin in the design was either 
powered up (i.e.. its drive strength was increased) or fanned 
out such that the output pin edge rate was below an arbi- 
trary value and the input pin edge rates on the net were all 
below a (different ) arbitrary value. This step also ensured 
that each macro output pin did not drive more than its maxi- 
mum capacitive load. 
Complete timing analysis. 

Fine pass. Once the coarse fix was made and macros were 
powered up to meet the edge rate specification, a topological 
traversal staining from the register inputs and working back- 
wards was made to further power up macros that were in 
broken critical paths. This step typically did not add more 



than 2000 basic cells to the design. Timing and pseudoroutes 
were updated after each macro power-up. The timing up- 
date algorithm retimed only those macros affected by the 
power-up. 

i Power down. The coarse fix step and Synopsys can over- 
power components. A topological traversal was made start- 
ing from register inputs and working backwards to power 
down those components in noncritical paths (i.e.. that had 
positive global slack) as long as the net loading rules were 
met. Timing and pseudoroutes were updated as the algorithm 

proceeded. 

Resynthesis was run at two different steps in the Bow. A 
first -pass resynthesis was done using floor plan net lengths 
and floor-plan-based approximate macro locations. This 
design was then placed and improved. The improved place- 
ment and net list from the first resynthesis were then fed 
back for a second complete resynthesis. (Fanout trees built 
during the first resynthesis were ripped out by the rip/join 
step.) The second resynthesis resulted in about 1000 fewer 
fanout buffers and 10.000 fewer basic cells used than the 
initial floor-plan-based resynthesis. 

Resynthesis and timing of the processor interface chip 
(317,000 random logic cells. —91,000 macros) typically took 
10 CPI 1 minutes on an IIP 9000 Model 735 workstation and 
consumed 320M bytes of virtual memory. 

Figs. 3 and 4 show the differences in the delays seen at reg- 
ister inputs between timing runs made at different steps in 
the flow. Fig. 3 shows the differences between floor plan 
timing after resynthesis and final placement timing. Values 
less than zero show floor plan timing that was conservative 
compared to the final placement timing. Values greater than 
zero show floor plan timing that was optimistic compared to 
final placement timing. 

Fig. 4 shows good, albeit conservative, correlation between 
the placement and route timing. Relatively few changes had 
to be made in the design once the designers had timing 
working at placement. The histograms clearly show that an 
attempt to "make timing" at the Synopsys/floor plan step is 
unnecessarily conservative. 

Placement 

The CARDS gate array suite of tools from Silicon Valley 
Research (SVR. formerly known as Silvar-Lisco), was chosen 
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as the central set of tools for placement of the server gate 
arrays. The choice of GARDS was based on a history of suc- 
cessful experience with these tools. A long-held philosophy 
in our laboratory has been to put the place-and-route tools 
in the hands of the designers, and to choose the tool that 
best allows the designer to parlay design knowledge into 
an intelligent placement. With only minor exceptions, the 
GARDS tools functioned capably and fulfilled the place-and- 
route needs for which they were chosen 

L'nlike many other design flows for 0.35-um gate arrays, 
hierarchy was flattened by the resynthesis step, and the 
placement tool had free rein to optimize placement across 
boundaries. A drawback of this approach is that major up- 
dates to a current placement (e.g.. synthesize one block and 
feed the changed block into the existing placement) were 
virtually impossible. In such cases, the placement process 
was restarted from the beginning. 

Complementary internal tools constructed all necessary 
files in GARDS format, such as placement obstruction files 
for opening extra routing channels and for inhibiting auto- 
matic placement in critical regions such as latch arrays. 

A valuable feature of the GARDS tool set is the C program- 
ming language database access routines known as GEARS. 
These made it easy for internal tools to extract design infor- 
mation to customize (he integration of GARDS into the tool 
flow. 

The GARDS libraries for placement and routing were crealed 
internally by translating Cadence LEF (Library Exchange 
Format) files provided by the chip vendor. The general 
placement scheme consisted of the following series of steps: 

1. Import of preplaced gates (1/Os, internal macros in the 
I/O-to-register or regis! er-to-I/O paths, and latch arrays) 

2. Import of placement obstructions (for channelization, 
latch array protection, and clock tree allocation) 

3. Min-cut placement of the imported floor plan. Min-cut 
placement is a recursive subdividing algorithm that seeks to 
minimize connections across the dividing line. The resulting 
placement has minimum wiring between macros. 

4. Run checkfit (internally developed tool). 

5. Unplace errantly placed components. 

(i. Min-cut placement of floor plan file containing only the 
errantly placed components. 

7. Run a series of walking-window improvement passes. 

8. Repeat steps 1 through 7 after a layout-based resynthesis. 

Steps 1 through 3 generally took about two hours for the 
larger designs, and could be automated through scripting. 
After the init ial read of the floor plan, it was necessary to 
check the integrity of placement adherence to the floor plan. 
An unfortunate feature of the placement tool was that if 
components could not be placed within their assigned floor 
plan bin, they would be placed according to the center of 
gra\ in of the connectivity, which usually was not acceptable. 

Step 1 involved an Internal tool, checkfif, which was developed 
to unplace errantly placed Components and to create a new 
reduced version of the floor plan bin file that could increase 
the original dimensions of the bins. Steps 5 through 7 could 



also be scripted, with step 7 consisting of a mix of net -length- 
based improvements and congestion-based improvements. 
Step 7 represents the largest investment ut CPU cycles, with 
a typical set of improvement passes taking about 24 hours. 

Clock Tree Builder 

Once placement was complete, the clock tree builder was 
used to generate different gated types of the master clock. 
The structure of the clock tree, which was identical for all 
CG61 arrays, was a uniform grid of taps driven from a central 
point (see Fig. 5). A tap was either an OR gate, which gener- 
ated some form of gated clock, or a buffer, which generated 
a free-running clock. Macro positions and most connections 
were predefined, with the only variables being the tap type 
I OR gate or buffer), the tap flavor ( which gating term drove 
the OR gate, if any), and the last-level connection to the 
registers. 

The clock tree builder removed taps obstructed by latch 
arrays, figured out a best initial assignment of taps associ- 
ated with a gating term register, and then minimized the last 
level of clock tree routing by optimizing the clock-load-to- 
tap assignments. The last level of clock tree routing was 
performed by the aulorouter. The dock tree builder pro- 
vided the user with the flexibility of manually selecting tap 
gating term assignments and tap load assignments. The com- 
plete clock tree consumed 1-5% to 2% of the die area. 

Skew 

Intrachip skew was calculated as the difference in delay 
between the fastest tap and the slowest tap. The maximum 
intrachip skew for these gate arrays totaled 400 ps. of which 
200 ps was actual metal and gate load variations and 200 ps 
was intrachip variance. 
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Fig. 5. Clock distribution tree. 
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Metal and gate load variations were mostly due to different 
numbers of loads and different route lengths in the last level 
of the tree. Dummy loads were added to the tips and center 
of the main clock rails after routing to balance the rails pre- 
cisely. Special control of the last-level loading and routing of 
I/O registers reduced the metal and gate load skew on those 
registers to 100 ps. 

The variance portion of skew included the voltage, tempera- 
ture, and process variations possible across the die. Variance 
also accounted for delay modeling uncertainty between OR 
gates and buffers in the last level of the tree. 

Scan Chain Hookup 

Once all the scan elements were placed, the scan chain was 
automatically connected. Each element had a property that 
identified the element's scan ring. The beginning and end of 
each ring had a property that marked these points as special. 
Buffers and pairs of inverters that were connected to the 
scan points were marked as equivalent to the scan point and 
could also be used. Each ring was stitched together using a 
"greedy" algorithm that chose the best element to add at 
each step. The best element was determined by assigning a 
penalty value to each available element and then choosing 
the one with the minimum penalty. The penalty value was 
a summation of four individual penalties: 
Distance. The distance between the element and the con- 
nection point. Reducing this value decreased the amount 
of routing needed to connect the ring. 
Clock. A penalty was assessed if there was a difference 
between the clock for the element and the clock for the 
connection point, that is. if the clocks came from different 
taps or rails. The more the clocks were similar, the less the 
chance for hold errors while scanning. 
Buffering. A penalty was assessed if the connection point 
was not buffered from the true scan point. Preferring the 
scan points that were buffered reduced the possibility of 
hold problems. 

Center crossing. The connection crossed the vertical center- 
line. The main clock distribution was in this area and hori- 
zontal routing was limited. Reducing the horizontal scan 
connections through this area simplified the routing for the 
design. 

Each of the above individual penalties had an associated 
weight so the relative importance of each could be adjusted. 

For the designs that had routing completion problems, a 
post-hookup optimization could be applied. This optimiza- 
tion only considered the distance between scan points as 
the objective to minimize and often resulted in over a meter 
of wire removed from the scan chain. 

After all of the scan rings were connected, the wires added 
at the gate level could also be inserted into the beha\ioral 
model. 

Routing 

Routing of the chips was accomplished with the GARDS 
route tools (garout/linesearch, garout/maze, redit/ripup-reroute). 
Normal signal routing occurred on metal layers L, 2. and 3, 
with only clock net routing on metal layer 4. The version of 
GARDS used by the server project did not include support 
for fourth-layer routing, so fixed metal 4 segments were 



specified and connected with internally developed post- 
processing tools dff2def and eclare (see below). 

The routing process began with certain special nets. These 
were mostly latch array nets and certain clock nets, which 
needed to be routed with the maze router (garout/maze). Then 
the line search router (garoui/lmesearch ) was used to route 
nearly all of the remaining connections, leaving only about 
1% to 1.5% failing to route. These were routed by the maze 
rouier. usually to within about 200 disconnects. At this point 
the design was taken into the redit tool, using the ripup-reroute 
feature of that tool. This normally completed the remaining 
connections. The whole routing process generally took one 
to two days to complete. 

Some special techniques were exploited to push the enve- 
lope of routing density. It was observed that metal layer 1 
was underutilized, and that some of the saturated metal 2 
routing could be forced down to metal 1. thus allowing ex- 
tended use of metal 2. This was done with an internally de- 
veloped tool that read the routing that had occurred up to a 
certain stage, determined candidates for moving metal 2 
horizontal segments to metal 1 horizontal segments (nonpre- 
ferred direction), and finally moving those segments. This 
technique helped complete the routing in critically dense 
areas. 

Another technique for achieving better routing completion 
was channelization (see Fig. 2). When a routing attempt 
failed to make all connections, additional routing channels 
were created in regions where concentrations of discon- 
nects were observed. The channel obstructions were fed 
back to the floor planner, where the designer could make 
sure that the effective utilization of each block was still 
below the safe limit. If necessary, blocks were enlarged. 

Postroute Analysis 

After placement and routing were complete, the capacitance 
and RC delays were extracted by an internally developed 
tool, eclare. eclare read the GARDS database to obtain the 
macro and routing information for the design. The Verilog 
gate level net list was also read so that the connectivity pres- 
ent in the GARDS database could be verified against the 
netlist connectivity, eclare then used the GARDS data to con- 
struct four arrays of grid points (one per metal level I that 
described the metal present at every grid location. The infor- 
mation stored at each grid point included metal width, type 
of metal, directions of connections, an index into a table of 
pin and net names, and a \isitcd flag. 

Storing this much information per grid point is expensive in 
terms of memory ( 720M bytes of storage is required lo hold 
the grid data for the current chips), but it greatly simplified 
the determination of neighbors during the extraction process. 
The project had access to several machines that had 1G 
bytes (or more) of physical memory, so the storage require- 
ment was not a serious impediment. This data structure also 
allowed the run time of the tool to be fairly short, eclare ran 
in less than 30 minutes, with about half of the time being file 
input/output. 

The extraction process began after all of the grid information 
was loaded. For every driver on each net. eclare traced all of 
the metal attached to the driving put. As the net was traced. 
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a list of nodes and segments was built for use in solving for 
the equivalent rt-model and RC delays. 

For each grid point traced, nearby grid locations were exam- 
ined to determine their contents. When all of the neighbor 
information was determined for the current grid point, a 
lookup table was consulted to obtain the capacitance value 
for this neighborhood configuration. This capacitance and 
the resistance for the current metal level were added to the 
appropriate entry in the segment list. The capacitance value 
was also added to a counter for each of the neighboring nets 
for use later when determining the coupling factor. 

When all of the net had been traced, the list of nodes and 
segments was used to solve for the RC delays to the input 
pins and the equivalent .t-model at the driver. All neighboring 
nets were examined to determine the maximum amount of 
capacitance that was coupled to the target net. If the maxi- 
mum coupled capacitance to any single neighboring net 
exceeded 20% of the total capacitance for the current net, 
the routing had to be adjusted to decrease the coupling. The 
coupling percentage was used to create a ininimum/maxi- 
mum range of capacitance values in the equivalent .i-model 
that was later used by the timing verifier. 

eclare performed a few other tasks in addition to the extrac- 
tion. The Fujitsu toolset checked for nets that failed their 
antenna rule (too much unterniinated metal connected to 
the gate of a transistor during fabrication), eclare also made 
this check, but earlier in the overall design flow. Optionally, 
eclare could output a distributed RC SPICE model for any of 
the nets in the design. Finally, eclare produced a File of clock 
rail loading that was used to determine the number of rail 
balance loads needed to equalize the total capacitance on 
each rail. 

Last Minute Changes 

Given the large latency from RTI, behavioral change through 
Synopsys and resynthesis to placed design ( usually three to 
four days), an interactive hreak editor was constructed to 
allow designers to edit the netlist and placement interactively. 
This X Windows tool displayed the placement, netlist, and 
timing information graphically and then allowed the designers 
to add, change, delete, and move parts and make other 
changes. Global chip timing information was updated in real 
time so that the effect of these netlist edits was instanta- 
neously viewable. 

The tweak editor gave the designers more than enough lati- 
tude to change the logical behavior of the gate design (and 
sometimes that was the exact intent). The new netlist was 
logically verified against the RTL behavioral model with 
lover, the Boolean equivalence verification tool. 

Vendor Interface 

The vendor interface consisted of a front end, where libraries 
of various formats were received from the vendor, and a 
back end, where ultimate tape-out-ready design files were 
sent to the vendor for go-to-make. 

' )n the front end, the vendor supplied all necessary libraries. 
Some of these were characterization data files in the vendor's 



internal format, and the rest were in various standard for- 
mats, such as Verilog and LEF Also provided was an inte- 
grated set of programs that were pieces of the vendor's own 
proprietary ASIC development package. These programs, 
which included the vendor's LCADFE and GLOSCAD pack- 
ages, also included library data. 

The physical libraries provided by the vendor were in the 
standard Cadence LEF format. An internally developed tool. 
le(2gards, was used to translate the LEF description of the 
base array and the macros into GARDS library formats. The 
Ief2gards translator had been used in previous projects, and 
only required some minor modifications to work with the 
vendor's CMOS LEF files. 

On the back end, another tool, drf2def. was developed to read 
the GARDS design file ( using GEARS access routines) and 
produce a logical netlist in the vendor's proprietary format. 
FLD, as well as a Cadence DEF (Design Exchange Format ) 
file, which contained all of the physical information such as 
placement and routing. The FLD file was imported into the 
LCADFE vendor package, where certain netlist expansions 
and design rule checks were done. Then the GLOSCAD 
package was used to import the LCADFE logical data and 
the DEF file physical data. This was followed by a run of the 
GLOSCAD laychk program, which did a thorough physical 
design rule check. Finally, the GLOSCAD pdilmake program 
produced the ;t-model and RC data, which was folded back 
into the LCADFE package for postlayoul liming simulation. 
A complete set of vector simulations was then run to verify 
both timing and logical aspects of the design. 

After determination of successful timing and clean LCADFE 
and GLOSCAD runs, and after passing a stringent tape-out 
review, the appropriate design files (essentially tar snapshots 
of the vendor's LCADFE and GLOSCAD user design tree) 
were transferred via ftp to the vendor's site. A go-to-make 
release usually followed within a few days of the original 
transfer. 

Conclusions 

Commercial synthesis and layout tools alone were not suffi- 
cient to achieve the ASIC performance, density, and time-to- 
market goals of the S- and X-class machines. A significant 
investment was made in both internally developed and com- 
mercial CAD tools to put together a highly optimized physical 
chip design process. The resynthesis process coupled with 
in-house place-and-route tools minimized direct human inter- 
action in optimizing (he synthesized designs and made it 
possible to achieve the project design goals, which would 
otherwise not have been possible. 
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Fast Turnaround of a Structured 
Custom IC Design Using Advanced 
Design Tools and Methodology 

Through the use of several new tools and methodologies, a small team 
of engineers was able to design and verify a 1.7-million-FET chip in eight 
months. The tools and methodologies used included a set of guidelines 
and timing constraints that were met by the customer, a data path 
compiler, a highly tuned custom multiplier cell that was used in 87 
locations, and an automated top-level power connection scheme. 

by Rory L. Fisher, Stephen R. Ilerbener, John R. Morgan, and John R. Pessetto 



The HP IMACC chip was developed to provide image pro- 
cessing capabilities. The iniiial larger application is medical 
imaging with geological applicalions as a potential area of 
expansion. The graphical capabilities of IMACC include 
spatial filtering, edge detection and enhancement, image pan 
and zoom, image rotation, and window and level control. 
IMACC consists of three major components: 
The convolver circuit has a 3 x 3 programmable kernel* and 
can perform low-pass or high-pass spatial filtering, edge 
enhancement, and other functions. 

The interpolator is an implementation of a 4 X 4 bicubic 
convolution kernel.* The interpolator can be configured 
to perform pan. zoom, and rotation. 

A RAM-based lookup table is used for windowing and level- 
ing of image pixel intensit ies. 

In support of the various user-selectable operating modes, 
any or all three of the functional blocks may be active at a 
time. The order of operations can be changed as desired, 
with the single limitation that the convolver must precede 
the interpolator if both modules are in the chain. When the 
image visualization accelerator board (IMACC is the heart of 
this board ) is attached to the IIP HCRX graphics subsystem, 
simultaneous convolution, zoom, rotation, and window" and 
level control of 1024-by-1024 pixel. 10-bit medical images at 
41) frames per second are possible. The accelerator can pro- 
cess more than 40 million pixels per second independent of 
the number or order of internal operations. 

Customer Interaction 

As a result of our experience in designing numerous ICs for 
various customers, our laboratory has developed some prac- 
tical, informal guidelines for designing ICs. At the beginning 
of the IMACC project, we met with the customer (another 
HP laboratory) and discussed these guidelines along with 
project goals. The guidelines we provided to our customer 
are as follows: 

A kernel is a lunctional unit thai can be repealed as needed A 3 x 3 programmable kernel 
performs a programmable function on a 3 x 3 array ot pixels A 4 x 4 bicubic convolution 
kernel performs a bicubic convolution on a 4 x 4 arrav of puels 



• The prime directive: Signal groups such as multisiale drivers 
on a single bus. multiple set signals into a flip-Hop, or multi- 
ple set signals into a multiplexer, may cause drive fights and 
therefore need to be completely decoded from the current 
state of the control machine so that one and only one will 
fire. This requirement must hold even if the chip comes up in 
a random state. Exceptions to this have caused significant 
delays in schedule right before tape release. 

• Signals require a consistent naming convention. 

• I "pdate flip-flops on the falling edge of the clock ( single-edge 
timing). 

• When glitchless values are required (Gray code counters, 
etc.), they must come directly out of flip-flops. 

• Resets are typically heavily loaded and will probably cause 
timing problems. Have each control block latch its own 
version of the chip reset, then generate its own local reset. 
This helps timing at the expense of latency. 

• Don't design multistate paths. Complete timing analysis of 
such a path is not possible in any design tool. 

• Don't set and dump the same FIFO or RAM location at the 
same time. 

• Keep Synopsys blocks small. 

• Keep large register tiles in the data path. 

• I 'se no clock uncertainty (skew ) in Synopsys. It will be 
there, but is better allowed for by reducing the period. 

• Don't allow Synopsys to try to fix hold problems. There 
should be none by design. 

• When setting your timing constraints, allow some slack for 
RC delays, the local clock generator, and incremental delays 
that will be introduced when actual routing capacitances 
are substituted into the timing model. For example. 15 ns 
might be a good period constraint at 60 MHz. 

• As constraints (timing and loading) become more accurate, 
make sure to continue to update them in your design. 
Accurate is better than conservative in Synopsys. 

• Simulate at the board level as soon as possible. 

• Simulate timing between blocks as soon as possible 
(schematic simulations are fairly accurate). 

• Simulate the chip coming up in random states as soon as 
possible. A proven way to do this is to make sure the chip 
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can come up with unknown values in all memory elements, 
including flip-flops and registers. 

The highest -priority design goal was to have a working 
IMACC system to demonstrate at an upcoming conference. 
We created a schedule consistent with this goal incorporat- 
ing the necessary checkpoints. Two of the most important 
checkpoints: 

• We were to deliver a top-level, schematic-based Verilog gate 
model to the customer so they could begin regression tests 
at the system level. This allowed them to identify design 
problems early. 

• The customer was to freeze the function of major data path 
blocks by a scheduled date. This allowed us to construct the 
artwork in a single pass. 

Setting a rigorous schedule as the first priority forced a 
streamlined design. Using a single clock domain made the 
design less complicated. A large data path block with non- 
critical functionality was eliminated because it would 
require too much design time. The die size was determined 
early and a previously characterized package was used. 
Timing budgets were kept conservative to ensure that 
IMACC would ran at 45 MHz after parasitic loading was 
added. 

Since all decisions were based on meeting our primary ob- 
jective, "creeping featurism" was eliminated. The customer 
was informed of the schedule impact that the addition of a 
new function implied. In most cases they were not willing to 
suffer a postponement of the chip release date. 

We also shortened the design time by making sure that when 
we sent them a new gate model of the chip, we had all the 
latest changes and we had no problems with the syntax of 
the model. To do this we used several tools (awsim, eval, etc.) 
to check for connectivity problems. We used an in-house 
history management system to maintain revision control of 
these gate models. 

Custom Multiplier 

One of the key pieces of circuitry on IMACC in terms of 
design leverage was an integer multiplier. The imaging algo- 
rithms that w e implemented typically executed a large sum 
of product terms. As an example, the convolver block in 
IMACC sweeps a 3 x 3 matrix across the source image (a 2D 
array of pixels), multiplies the nine coefficients in the mat rix 
by the corresponding pixels, and adds these nine products 
to compute a single new pixel value for the target image. 
Therefore, this block alone required nine multipliers and 
eight adders. With the customer's help w-e found that we 
could consolidate most of the multipliers into one design. 
The result is that in the IMACC design, a single 18 x 18 inte- 
ger multiplier circuit is used 87 times. 

It then became very important to make this multiplier as 
dease as possible. This problem was attacked on two levels. 
First, an area-efficient architecture was chosen, and secondly, 
the key multiplier cell was painstakingly designed and laid 
out to reduce its area as much as possible. 

The architecture we chose was a radix4 (two bits at a lime) 
Booth-encoded array. 1 A standard multiplier array in our 
case consisted of 18 rows, each row representing the result 
of one of the multiplier bits limes the whole multiplicand 
value Since the Booth-encoded array looks at two bits of 



the multiplier times the whole multiplicand in each step, this 
cut the number of row-s in half to rune. It turned out that the 
increase in the row height resulting from a more complex 
unit cell was well below the height of two of the standard 
rows. (How ever, there is additional circuitry that needs to 
be added to perform the Booth encoding of all of the sets of 
two multiplier bits. ) As a side benefit, cutting the multiplier 
rows in half also increased the circuit's speed. 

This speed-up benefit helped with our second task of reduc- 
ing the area of the critical multiplier cell. Our design require- 
ment was to execute the 18 x 18 multiply in a single clock 
state (22 ns). When the circuit was initially built, it executed 
the multiply in about two thirds of the required time (about 
14 ns). Therefore, it was possible to shrink the critical oeO 
by reducing the FET sizes until the cell delay increased the 
multiply time to the 22-ns limit. Along with some diligent 
artwork layout, this produced an extremely dense cell. 

Our final 18 x 18 Booth-encoded multiplier is 864.0 um wide 
by 307.S um high for a total area of 0.266 mm 2 . Using the 
standard multiplier array, this multiplier would have been 
648.0 um wide by 446.7 um high for a total area of 0.28! I mm-. 
Considering just the areas of the multipliers, the savings per 
mull iplier is 0.023 mm- and the total savings for IMACC is 
2.0 mm 2 . However, the aspect ratio of the standard multiplier 
did not lit well with the IMACC data path blocks (18 bits 
wide versus 24 bits wide). Taking into consideration the 
empty spaces that the standard multiplier would have pro- 
duced, the savings per multiplier increases to 0.123 mm 2 and 
the total savings for IMACC increases to 10.7 mm 2 . 

The custom multiplier design cut schedule lime for two 
reasons. Using it 87 times in numerous blocks across the 
chip saved us the time we would have spent assembling 
(and possibly designing) multipliers for each specific need 
in the different blocks. Secondly, the significanl area savings 
we realized made the job of lop-level chip routing much 
easier and consequently faster, since the top-level blocks 
were smaller. 

Data Path Compiler 

We were able to save many hours of ail work layoul lime 
through the use of Dpcl4, 2 a data path artwork generation 
tool that places and routes data path blocks. A data path is 
made up of hierarchical horizontal slices, often called macro 
cells, that are, usually, but not always, bit-wise logic repealed 
as many times as needed. Examples of macro cells are a two- 
input. 32-bit register, an 18-bit ripple carry adder, another 
data path block, or a custom data path block The program 
dpc_tiler was used lo build many of these macro cells. Macro 
cells can be placed vertically or horizontally with respect to 
one another. Routing is done over and between macro cells 
to conned signals within the data path. The user has the 
ability to control placement and how signals are routed. The 
user can assign signals to specific data path tracks or defer 
routing of signals such that the allow layers for the signals 
(layers that can be used later in the design to route signals) 
are placed on unused tracks. 

To use Dpcl4. we first generated a Dpcll input file, which 
was created from the schematic Block Description lan- 
guage (BDL) file. The Dpcl l input file could also be gener- 
ated from a Verilog netlist file. The tool bdl2dp was used to 
generate the input file from the schematic BDL foi most of 
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the IMACC data path blocks that used the schematic BDL. 
A simple input file is shown in Fig. L 

Artwork encapsulation information needed to be extracted 
for each macro cell used in the data path. The Perl script 
dpEncaplnfo was used to extract information about how to 
connect to ports within the macro cell. It was necessary to 
specify to dpEncaplnfo any wart cells on the left or right side 
of the macro cell. For instance, a typical DPLIB14 register has 
two wart cells on the right side of the macro cell. In this 
case we used dpEncaplnfo -r 2 to build the encapjnfo file 
correctly, Tliis specified that there were two wart cells on 
the right side of the register macro cell. 

After the Dpcl4 input file was created and encapsulation 
information had been extracted, the Dpcl4 program was 
used to generate an artwork archive that could be read into 
our artwork editor. The Dpcl4 file could then be edited if 
the artwork needed to be modified, allowing us to make as 
many iterations as needed to produce the desired result. 



-block dpcl 
-trackassign a 5 
-dpwidth 24 
-newrow { 

-inst HOLE TOP 24 0 { 

-dpsig A[23:0] a[23:0] 0 

-dpsig B[23:0] b[23:0] 0 

) 

} 

-newrow { 

-inst REG224 rega 24 0 { 

-dpsig INA[23:0] a[23:0] 0 
-dpsig INB[23:0] b[ll:0] 0 
-dpsig OtJT[23:0] out[23:0] 0 -metall 

} 



-newrow { 

-chanroute { -signal cinput[23:0] -right ) 

-inst MUX224 muxa 24 0 { 

-dpsig INA[23:0] a [23:0] 0 

-dpsig INB[23:0] out [23:0] 0 -metall 

-dpsig INC [23:0] cinput[23:0] 0 

-dpsig OOT[23:0] muxa[23:0] 0 

-cntlsig SELA sela 0 

-cntlsig SELB selb 0 

-cntlsig SELC selc 0 



} 



) 



-newrow { 

-inst HOLE BOTTOM 24 0 ( 

-dpsig muxa [23:0] muxa [23:0] 0 

} 



Fig. 1. A simple input file for the Dpcl 4 data path artwork 
generation tool. 



Local Toolbox 

A number of relatively simple scripts and programs of our 
own devising were combined into a local toolbox for the 
project. The more significant of these are described here. 

The mkcntl script uses Synopsys, block place-and-route. and 
ot her tools to go from a Synopsys netlist through schematic 
to artwork with parasitics in one command. Of course, one 
must iterate on the place-and-route portion to ensure work- 
able size, form factor, and port locations. An iteration can be 
accomplished with mkcntl -b. 

We used a connectivity tracer that reads schematic ( scip) 
BDL and reports the connectivity of the specified instance 
or net. The trace is limited to one level of hierarchy. For net 
names, regular expression pattern matching is available. 

We automated a lot of the top-level power connection of 
IMACC using two scripts. Several steps were involved in this 
methodology. First, two symbolic layers were used, one for 
VDD and one for GN0. to define where the metal-4 power 
buses would go. Next, the getpwrconn script used trantor to 
find the intersection of the metal-3 power buses with the 
symbolic layers and dumped them into an artwork editor 
archive file. Last, the gen_pg_conn script placed contact-4 
contacts in the intersection areas and filled the symbolic 
layers with metal 4, 

Our addallow script selects VDD, GND. and CLK metal-3 shapes 
by size and copies them to metal3. allow and contact4.allow 
layers, permitting connection to over-the-cell metal 4. 

There are two shieldmaker scripts in our toolbox: addshield2 
and addshield3. These scripts fill the unused areas of the inter- 
mediate (cross-channel) metal layer in routing channels. 
These areas are then tied to GND or VDD to provide crosstalk 
shielding for signals running the length of the channel. 

Diode placing software was used to eliminate large numbers 
of charge collectors. This software finds traces longer than 
1000 um in a routing channel and locates areas where diodes 
can be added without introducing design rule check errors. 
This technique worked well for our project, which had a 
tight schedule, lots of charge collectors, and minimal timing 
problems. 

All of the scripts that add shapes to a source block do so 
through intermediary block pieces to ease modification or 
rebuilding of the added function. 

Results 

The IMACC chip was demonstrated in systems at the Radio- 
logical Society of North America conference in Chicago on 
November 27. 1995. 

The IMACC chip contains 1.7 million FETs in the HP 
BiCMOS14QC process operating at a 45-MHz clock frequency. 
It is predominantly a data path design with 98 integer multi- 
pliers performing 4 billion integer multiplies per second on 
lS-bit or larger operands. A breakdown by design style is as 
follows: 
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Style 

Data Path 

Standard 
Cell 

RAM. 
FIFO 

Pads, 

Clock. 

etc. 



Percent of 
Total FET 
Count 

60% 

10% 



21% 
9% 



Number of 

Standard Cell Thousands of 

Gate Equivalents FETs per mm 2 

497.000 22.0 

53.000 10.8 



(>48K bytes) 



42.7 



Some additional statistics for the IMACC project include: 
2342 FETs/day. 8673 FETs/mm 2 . and more than (560,006 
standard cell equivalent gates. 
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In 1992 he was awarded his second MS degree in 
engineering science, specializing in graphical user 
interface design, at Simon Fraser University ISFUI 
in Canada. He participated in the Intelligent Graphic 
Interface project at SFU. where he developed a 
variable zoom technique to improve user navigation 
among telecommunication networks. He also devel- 
oped computer training software to help English- 
speaking people learn Chinese Richard is married 
and enjoys playing badminton and traveling. 

59 00 Network Management 
Development 

Peter E. Mellquist 

A software design engineer 
at HP's Workgroup Network 
Division since 1994, Peter 
Mellquist designs and builds 
core technologies for net- 
work management He is 
currently building Web- 
based network management 
protocols and solutions. He 
received a BSCS degree in 1988 and an MSCS degree 
in 1992, both from California State University at 
Chico. Before joining HP. he worked at Bently Nevada 
Corporation, where he designed and developed data 
acquisition protocols In his spare time. Peter enjoys 
fly fishing, traveling, and gardening. 
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Thomas Murray 

Tom Murray is a software 
engineer at HP's Telecom 
Platform Division and is cur- 
rently working on network 
management solutions for 
the broadband internet de- 
livery system. Previously he 
worked on network system 
management strategies for 
HP's multicomputer clustering technologies and pro- 
vided tracing and logging support services for HP-UX 
workstations. Tom received a BA degree in 1987 from 
the State University of New York ISUNY] at Potsdam. 
After graduating he joined HP as a software quality 
engineer designing test systems for network services 
and products. Motorcycling and traveling are two of 
his free-time activities. 

66 Vector Network Analyzer 



Frank K. David 

frank David is an R&D proj- 
ect manager at HP's Micro- 
wave Instrument Division 
and is responsible for the 
next -generation vector net- 
work analyzers. He joined 
HP in 1969 as a microwave 
design engineer. Since 1981 
he has managed projects 
involving numerous HP products, including the HP 
1 1 970 Series millimeter-wave mixers, the 1 1 975 LO 
amplifier, the HP 85070A materials measurement 
probe, the HP 8753D/52C RF vector network analyz- 
ers, and the HP 8720A/C/D microwave vector net- 
work analyzers He is a coauthor of a textbook called 
Electronic Instrument Handbook. He is named as a 
coinventor in two patents involving millimeter-wave 
signals He received a BSEE degree in 1965 from the 
University of California at Berkeley and an MSEE in 
1975 from Oregon State University. Before coming to 
HP, he worked for the State of California Department 
of Water Resources designing control circuits for 
pumping plant motors Born in Sacramento, frank is 
married and has two children. He actually enjoys 
doing home and auto maintenance in his free time 

Frederic W. Woodhull II 

A development engineer at 
HP's Microwave Instrument 
Division, fred Woodhull has 
worked as a design engineer 
and project leader for over 
five years on HP's network 
and spectrum analyzers. 
He joined HP in 1968 as an 
instrument repair technician 
and later served as a production line supervisor and a 
production engineer As a designer, his projects in- 
clude hardware and/or firmware design for the HP 
8805A. 8754A, 8757C/D, and 8753D network analyzers 
and project leadership for the hardware and firmware 
design of the HP 8757A network analyzer He did the 
hardware and firmware design for the color LCD In 
the HP 8720D vector network analyzer, and is cur- 
rently developing a central processor for a family of 
netwuik analyzers. Fred is named as a coinventor in 
a patent on a circuit to count the marker frequency of 
a swept network analyzer Born in Ross, California, 
he served three years in the U S Army as a specialist 
fifth class. He has a teenage son who likes to join 
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him on camping and watersknng trips Fred also enjoys 
working in his home workshop, perhaps recalling his 
adventures from the 1960s when he sailed around 
the world in an 85-foot motor-sailer 

Richard R. Barg 

Dick Barg is a product de- 
sign engineer at HP's Micro- 
wave Instrument Division. 
He has worked on product 
design and mechanical engi- 
neering since joining HP's 
— " JMV Rockaway Division in 1966. 

and has contributed to 
numerous products including 
the HP 8699A RF unit, the HP B605A sweep oscillator, 
the HP 8746B test set. and the HP B753B/C/D, 8754A, 
HP B755A, and HP8719/20/22A/B/C/D network 
analyzers Dick received a BSEE degree in 1964 from 
Fairleigh Dickinson University He was born in Jersey 
City. New Jersey, and spent two years in the U.S 
Army. He has been married for 39 years and has two 
children. Most of his free lime is spent traveling the 
world. He has visited all of the continents on earth 
except Antarctica at least once. 

Joel P. Dunsmore 

A development engineer at 
HP's Microwave Instrument 
Division. Joel Dunsmore 
works as an RF and micro- 
wave applications specialist. 
He did microwave switch 
and amplifier/detector de- 
sign and ALC design for the 
HP 8720D vector network 
analyzer. Before that, he did RF source, receiver, and 
test set design for the HP B753A/D network analyzers, 
the HP 83420 lightwave test set. and the HP 85047 
RF test set. Professionally interested in RF and micro- 
wave measurement techniques, he has authored 
numerous articles and taught a course on these 
subjects, and consults in the U.S., Europe, and Japan 
He is named as an inventor in three patents. He is a 
member of ihe Automatic RF Techniques Group and 
the Society of Cable Telecommunications Engineers. 
Bom in Oregon. Joel was awarded a BSEE degree in 
1982 and an MSEE degree in 1983 from Oregon State 
University. After graduating he |oined HP and worked 
as an RF and microwave development engineer Joel 
is married and has two children. He is a local Cub 
Scout leader and his hobbies include woodworking 
and gardening. He also enjoys skiing and recently 
started building model railroads with his son. 

Douglas C. Bender 

Doug Bender is a software 
development engineer for 
RF and microwave network 
analyzers at HP's Microwave 
Instrument Division He has 
done software development 
fortheHP8757Aand 
8720A/B/C network analyz- 
ers and served as the firm- 
ware proiect manager for numerous projects including 
the HP 83750 synthesized sweeper. HP 871 xB RF net- 
work analyzers, the HP 8753D/52C RF network ana- 
lyzers, and the HP 872x0 vector network analyzers 
Doug received a BS degree in 1 973 from the U.S. 
Military Academy and an MSEE degree in 1980 from 
Stanford University. He then joined HP's Santa Rosa 
Division as a product marketing engineer responsible 





for scalar and vector network analyzers. He has 
authored two articles on the scalar analyzer system 
and on microwave network measurement techniques. 
Born in Lincoln, Nebraska, Doug is married, and has 
a daughter. Amateur radio is one of his interests and 
his call sign isWWBD. 

Barry A. Brown 

An engineer/scientist at HP's 
Microwave Instrument Divi- 
sion. Barry Brown is cur- 
rently responsible for data 
acquisition and DSP hard- 
ware and software design 
for network analyzers. He 
designed the IF multiplexing 
circuit and other miscella- 
neous circuits for the HP 8720D vector network ana- 
lyzer Previously he was a development engineer for 
the HP 851 OA network analyzer, project leader for a 
low-cost RF tracking generator, and development 
engineer for the HP 8753A/B/D network analyzers. 
He worked on data acquisition, IF processing, fre- 
quency synthesis, digital control, state machine, and 
CPU design. Professionally interested in digital signal 
processing, he has authored or coauthored three ar- 
ticles and four conference papers about network 
analysis and is named as an inventor in a patent con- 
cerning nonlinear network analyzer hardware Born 
in Indiana, he received a BSEE degree in 1976 from 
Purdue University and an MSEE degree in 1979 from 
Stanford University. His outside interests include hik- 
ing, camping, Windows programming, and fine So- 
noma County wines. 

Stanley E. Jaffe 

^^^^ Stan Jaffe joined HP's Santa 

f~ ^ Rosa Division in 1980. He is 
currently a development 
an i engineer at HP's Lightwave 
Division and is responsible 
for the firmware and digital 
architectural design for the 
next-generation lightwave 
communications test instru- 
ments. Since joining HP, he has contributed to a num- 
ber of products including the HP 70902/3 modular 
spectrum analyzer IF modules, the HP 8752/3 RF net- 
work analyzers, and the HP 8719/20/22 microwave 
network analyzers. He did the source frequency 
control design and firmware for the HP 8720D vector 
network analyzer. He is named as an inventor in a 
patent on a phase-locking circuit and has authored 
two articles on thermistors. He is professionally inter- 
ested in real-time control firmware and Windows 
programming in C++. Born in Downey, California, 
he received a BSECE degree in 1980 from the Univer- 
sity of California at Santa Barbara and an MSCE in 
1993 from National Technological University Stan is 
married and has one cat. He is a board member at a 
local library and his hobbies include home remodel- 
ing, woodworking, guitar, piano, mountain biking, and 
computers. 





78 Interconnect Geometry Optimization 



Khalid Rahmat 

Khalid Rahmat received BS 
and MS degrees in 1985 and 
1987 in electrical engineer- 
I «w-y ftKM mg, and a PhD in electrical 

HH^ engineering and computer 

*~ science in 1995, all from the 

Massachusetts Institute of 
Technology. Later in 1995. 
he became a member of the 
technical staff at HP Laboratories in the ULSI Labora- 
tory He is currently doing research in semiconductor 
device and interconnect design and optimization for 
high-performance integrated circuits Professionally 
interested in these subjects, he has authored several 
articles and conference proceedings on his work. 
Before coming to HP, he worked at Standard Micro- 
systems Corporation doing semiconductor device 
characterization and modeling. 

Soo-Voung Oh 

Soo-Young Oh received his 
BSEE degree from Seoul 
National University in 1972, 
and his MSEE and PhD de- 
grees in electrical engineer- 
ing in 1977 and 1980 from 
Stanford University. He then 
joined HP and is currently 
the manager of a group in 
the ULSI laboratory at HP Laboratories, working on 
process and device simulations, statistical metrology, 
and interconnect characterization and modeling. His 
work has resulted in thirty-one publications, and he 
has coauthored a book titled Computer-Aided Design 
and VLSI Device Development, published by Kluwer 
Academic Publishers in 1986 and 1988 




84 Phase-Locked Loop 



Thomas J. Thatcher 




Tom Thatcher is a design 
engineer at HP's Integrated 
Circuit Business Division. 
Since 1992 he has worked 
on the HP color LaserJet 
and the HP LaserJet 5P and 
5MP printers. He is currently 
involved with the superin- 
tegration of microprocessors 
on ASICs. For the phase-locked loop project reported 
in this issue, he worked on HDL coding and circuit 
design, modeling and simulation, and test develop- 
ment He received a BSEE degree in 1988 from 
Seattle University and an MSEE degree in 1992 from 
the University of Illinois at Urbana-Champaign. 
Before coming to HP he worked at Seattle Silicon 
Corporation as a design engineer working on soft- 
ware testing of CAD tools, circuit design, and test 
program development He is professionally interested 
in circuit simulation and test design. In his free time 
he enjoys playing the piano 
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Michael M Oshima 

^^^^^ ft circuit design engineer in 

atHPslnte: 
Wf. -^^B Business Division, Mike 

>^*--_ Before that he designed 

high-speed bipolar ICs Mike 
is professionally interested m analog circuit design 
He was awarded a BSEE degree in 1983 and an MSEE 
degree in 1387. both with concentrations in semicon- 
auctor devices, from the University of Hawaii After 
graduating, he joined the HP Santa Clara Technical 
Center as a new product introduction engineer, sup- 
porting the release of custom bipolar integrated cir- 
cuits from development to manufacturing Mike is 
married. In his free time he enjoys tennis, skiing, and 
growing orchids 

Cindy Botelho 

Cindy Botelho is a design 
engineer at HP's Integrated 
Circuit Business Division 
IICBDI. She initially worked 
in the area of failure analy- 
sis, then moved to R&D, 
where she worked on six 
ASIC designs for HP Laser- 
Jet Series 4 through 6 print- 
ers. She was the design liaison for the ASIC reported 
on in this issue, and is currently a program manager 
overseeing relationships with third-party vendors. 
She received an MSEE degree in 1990 from Oregon 
State University. In her free time, Cindy enjoys wind- 
surfing, gardening, and singing in a garage band 
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BenB. Sheng 

Ben Sheng was a mixed-signal design engineer at 
HP's Integrated Circuii Business Division until he left 
HP in October 1996 For the project reported in this 
issue, he was responsible for system-level modeling 
and simulation, design process and tools support, 
and digital and analog circuit design Before that he 
worked as a physical design engineer on ASICs, pro- 
viding back-end verification services and support He 
is a member of the IEEE. Ben received a BSEE degree 
from the University of California at Berkeley in 1992 
He was born in Beijing, China 

HughS.C. Wallace 




Born in Eaglesham. Scot- 
land, Hugh Wallace was 
awarded a BS degree in 
engineering science with 
honors from Edinburgh Uni- 
versity After graduating he 
joined HP's Queensferry 
Telecom Division. He is cur- 
rently a CMOS integrated 
circuit design engineer doing digital design for an 
IEEE 8802.3 lOBase-T LAN MAU Recently he worked 
on block design for the project reported in this issue 
and before lhat on IC design for various chips He 
also designed 3-Gbit/s bipolar MUX and DEMUX ICs 
Hugh is married and has two daughters He teaches 
basic Logo programming at a local elementary school 
and serves on the school's technology committee. In 




his free time he enjoys building furniture using hard- 
woods. 

James S. Ignowski 

Jim Ignowski is a member of 
the Fort Collins technical 
staff of HP's Integrated Cir- 
cuit Business Division He 
currently does digital and 
analog CMOS design for 
VLSI ICs and recently did 
block design for the project 
reported in this issue. Before 
that he did analog and digital design for various ICs 
in such areas as IC test, graphics, read channels. 
CPUs, and computer systems He has authored or 
coauthored several papers for HP's Design Technology 
Conferences (DTCs). Jim is a member of the IEEE and 
sits on a DTC technical committee Born in Sioux 
Falls. South Dakota, he received a BSEE degree in 
1984 from Rice University and an MSEE in 1988 from 
Stanford University He joined HP's Optical Commu- 
nications Division in 1984 and did bipolar and CMOS 
analog design for optocoupler products. He is named 
as an inventor in a patent involving circuits for realiz- 
ing an optical isolator. Jim is married and has two 
children He is the commissioner of the HP Fort Collins 
softball league. His activities include sports such as 
Softball, basketball, and football. He also enjoys 
music and outdoor recreation such as hiking, camp- 
ing, and skiing 

95 0.35-1 im Gate Array Design 



Lionel C. Bening 

An engineer/scientist at HP's 
Convex Division since 1989. 
Lionel Bening was recently 
responsible for the Verilog 
design language style deci- 
sions for the HP Exemplar 
S-class and X-class technical 
servers Since joining HP, he 
has developed tools and 
methods for speeding up logic simulations for the 
Convex C3 and C4 compute' designs. He is currently 
responsible for the future development of the parallel 
processing server design language and for develop- 
ment of simulation CAD tools He has authored or 
coauthored papers on logic simulation and timing 
verification for several technical conferences and is a 
member of the IEEE Computer Society, the ACM, and 
the ACM Special Interest Group on Design Automa- 
tion. He was awarded a BSEE degree from the Uni- 
versity of Minnesota in 1962 He then worked at Con- 
trol Data Corporation and was an ad|unct instructor 
at the University of Minnesota Lionel is married and 
has two daughters. He enioys playing touch football, 
singing in a local chorus, and enhancing his web site, 
which is available at http://ourworld compuserve.com 
/homepages/ben mg 






Tony M. Brewer 

Tony Brewer is a system 
architect at HP's Convex 
Drvision ana is responsible 
for hardware design and 
\ -; j i computer architecture He is 
J 

evaluation of system en- 
hancements for the HP SPP 
2000 system He is a mem- 
ber of the IEEE and wrote and presented a paper at 
the 1995 CompCon conference He earned a BSEE 
degree in 1980 and an MSEE degree in 1982. both 
from Purdue University. 

Harry D. Foster 

Born in Bethesda. Maryland. 
Harry Foster was awarded a 
BSEE degree in 1983 from 
the University of Maryland 
and an MSCS degree in 
1989 from the University of 
Texas He then joined HP's 
Convex Division as a engi- 
neer/scientist and has been 
responsible for CAD tool research and development 
for the Convex C3400 and C4 computers. Recently he 
worked on the HP Exemplar S-class and X-class tech- 
nical servers and was responsible for the formal veri- 
fication, resynthesis, and clock tree builder. He also 
contributed to the postsilicon design verification. 
Before joining HP, he worked at Texas Instruments' 
Defense Systems Group as a board and ASIC design 
engineer In his free time. Harry loves to travel, lift 
weights, and inline-skate. 

Jeffrey S. Quigley 

Jeff Quigley jained HP's Con- 
vex Division in 1986 and has 
worked on CAD tool sand 
processes for the Convex C2, 
C3. C4 and Exemplar S-class 
and X-class technical serv- 
ers. He is currently the de- 
sign automation manager for 
the Exemplar servers Jeff is 
a member of the IEEE. He received a BSEE degree in 
1980 from the University of Kansas and an MBA in 
1984 from Southern Methodist University Before 
coming to HP he worked at Texas Instruments in the 
Defense Electronics Group in various design and CAD 
positions. Jeff was born in Omaha. Nebraska He is 
married and has a daughter and son He enioys out- 
door sports such as running and swimming. 

Robert A. Sussman 

Bob Sussman has been 
a design engineer at HP's 
Convex Division since 1986 
and was responsible for the 
development of ASIC CAD 
tools 'or the HP Exemplar 
S-class and X-class techni- 
cal servers Bob earned a 
BSEE degree in 1984 from 
the University of Virginia He continues to work on 
ASIC CAD tools. 
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Paul F. Vogel 

Born in Peoria. Illinois. Paul 
Vogel received a BSEE de- 
gree in 1983 from the Uni- 
versity of Illinois, A design 
engineer at HP's Convex 
Division, he has worked on 
ASIC CAD tool development, 
including the latch array 
compiler, floor plan tool, 
and postroute analysis for the HP Exemplar S-class 
and X-class servers Previously, he was an ASIC 
design engineer He is currently doing postsilicon 
design verification. Before joining HP. he worked 
at Mostek as a microprocessor product engineer and 
at Honeywell as a VHSIC product engineer. 

Aaron W Wells 

A CAD engineer at HP's Con- 
vex Division, Aaron Wells 
provided the primary place 
and route support for the HP 
SPP 2000 computer and is 
currently responsible for 
physical design strategies 
for HP SPP 2000 follow-ons. 
Aaron received a BSEE de- 
gree in 1984 from the Georgia Institute of Technology. 
Before coming to HP, he worked at Harris Corporation 
as a CAD tools specialist in the Computer Systems 
Division. He joined HP in 1989 as an ASIC physical 
design specialist. Aaron was born in Cynthiana, 
Kentucky 

104 Fast Chip Turnaround 




Rory L. Fisher 



Rory Fisher earned a BSEE 
degree in 1991 and an MSEE 
degree in 1994, both from 
Brigham Young University. 
After graduating, he joined 
the product design group at 
HP's Integrated Circuit Busi- 
ness Division. As a hard- 
ware design engineer, he is 



currently designing a memory interface chip for a 
high-performance file server He recently worked on 
the convolver circuit, bus interface, and test method- 
ology for the HP IMACC chip He has also worked on 
a texture mapping chip and an image enhancing chip 
for medical applications. A member of the IEEE, Rory 
is interested m high-speed data transfer and reliable 
test strategies. Before coming to HP, he worked at 
IBM in Burlington, Vermont, developing an Ethernet 
and token ring chip and a graphics coprocessor. Born 
in San Jose. California, Rory is married and enjoys 
racquetball, snowboarding, watersknng. and cycling 
in Rocky Mountain National Park 

Stephen R. Herbener 

A hardware engineer at HP's 
Integrated Circuit Business 
Division, Steve Herbener is 
responsible for VLSI hard- 
ware and CAD tool design 
He has worked on ICs for 
disk drives and hardware 
graphics accelerators for HP 
9000 Series 300 through 700 
workstations. He did VLSI hardware design for the HP 
IMACC chip Steve is a member of the IEEE. He was 
awarded a BS degree in computer science in 1982 
from the University of Nebraska at Lincoln and an 
MSEE degree in 1985 from the University of Illinois 
at Urbana, Champaign. Before joining HP in 1986, 
he worked as a hardware engineer at Data General, 
designing a PC XT clone. Born in Peoria, Illinois, 
Steve is married and has one son In his free time, 
he serves as a Boy Scout leader and loves spending 
lime in the Colorado outdoors hiking, skiing, camping, 
and river rafting. He is passionate about music, plays 
trumpet, and is a member of HP's jazz band, the 
Greeley Chamber Orchestra, and the Northern 
Colorado Concert band. He also composes music 
on his electronic synthesizer 





John R. Morgan 

John Morgan is a hardware 
design engineer at HP's Inte- 
grated Circuit Business Divi- 
sion and is currently working 
on the design of graphics 
chips. He did block design 
and custom tool develop 
mem (or the HP IMACC chip 
With HP since 1976, he pre- 
viously designed CAD tools, including a reticle gener- 
ator, worked at the HP Loveland Technical Center 
developing and supporting ICCAD tools, and did pio- 
neering work on HP's hierarchical IC design method- 
ology at HP's Colorado Springs Technical Center He 
was awarded a BSEE degree in 1979 from the Univer- 
sity of Colorado at Colorado Springs. John is married 
and has two daughters. He spent three years in the 
U S Army attending 0CS and serving in Germany as 
an artillery lieutenant. He is active in his church and 
is chairman of the administrative board. In his free 
time, he enjoys singing opera, woodworking, and 
designing loudspeakers 

John R, Pessetto 

^gtg^ Bom in Salt Lake City. Utah, 
jf^^BB^ J° nn Pessetto received a 

k> ■ BSEE degree in 1980 and an 

W 1 ^ MSEE degree in 1982, both 

7 from the University of Utah 

After graduating, he spent 
seven years at IBM doing 
DRAM and SCRAM design 
During that time he co- 
authored two papers on his work and was named as 
an inventor in a patent involving DRAM memory cells. 
In 1989 he joined HP's Integrated Circuit Business 
Division as a hardware design engineer and has been 
designing graphics display and accelerator chips. He 
designed the interpolator for the HP IMACC chip and 
is currently designing an interpolator and span-step- 
per for a new graphics chip John is a member of the 
IEEE. He is married, has three children, and coaches 
basketball and baseball for a local youth league. His 
hobbies and challenges include investing in his stereo 
system, attempting to knock a Softball out of the ball 
park, and teaching his daughter about boys. 
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